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A MEGAKARYOCYTOPOIETIC FACTOR 

The present invention relates generally to a 
novel protein factor which is important in regulating the 
human hematopoietic system. More specifically the 
invention discloses a novel protein factor that 
stimulates megakaryocyte colony formation and the 
differentiation or maturation of megakaryocyte 
progenitors. Also provided are processes for obtaining 
the factor in homogeneous form and producing it by 
recombinant genetic engineering techniques. 

Background of the Invention 

Megakaryocytes are the hematopoietic cells, 
largely found in the bone marrow, but also in peripheral 
blood and perhaps other tissues as well, which produce 
platelets (also known as thrombocytes) and subsequently 
release them into circulation. Megakaryocytes, like all 
of the hematopoietic cells of the human hematopoietic 
system, ultimately derive from a primitive stem cell 
after passing through a complex pathway comprising many 
cellular divisions and considerable differentiation and 
— maturation. — ~~ — — 
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The platelets derived from these megakaryocyte 

cells are critical for initiating blood clot formation at 

the site of injury. Platelets also release growth 

factors at the site of clot formation that speed the 

5 process of wound healing and may serve other functions. 

However, in patients suffering from depressed levels of 

platelets (thrombocytopenia) the inability to form clots 

is the most immediate and serious consequence, a 

potentially fatal complication of many therapies for 

10 cancer. Such cancer patients are generally treated for 

this problem with platelet transfusions. Other patients 
frequently requiring platelet transfusions are those 
undergoing bone marrow transplantation or patients with 
aplastic anemia. 

15 Platelets for such procedures are obtained by 

plateletp^oresis from normal donors. Like most human 
blood products, platelets for transfusion have a 
relatively short shelf-life and also expose the patients 
to considerable risk of exposure to dangerous viruses, 

20 such as the human immunodeficiency virus (HIV) . 

- Clearly the ability to stimulate endogenous 
platelet formation in thrombocytopenic patients with a 
concomitant reduction in their dependence on platelet 
transfusion would be of great benefit. In addition the 

25 ability to correct or prevent thrombocytopenia in 

patients undergoing radiation therapy or chemotherapy for 
cancer would make such treatments safer and possibly 
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permit increases in the intensity of the therapy thereby 
yielding greater anti-cancer effects. 

For these reasons considerable research has 
been devoted to the identification and purification of 
factors involved in the regulation of megakaryocyte and 
platelet production. Although there is considerable 
controversy, the factors regulating the growth and 
differentiation of hematopoietic cells into mature 
megakaryocytes and the subsequent production of platelets 
by these cells are believed to fall into two classes: 

(1) megakaryocyte colony-stimulating factors (meg-CSFs) 
which support the proliferation and differentiation of 
megakaryocytic progenitors in culture, and 

(2) thrombopoietic (TPO) factors which support the 
differentiation and maturation of megakaryocytes 
resulting in the production and release of platelets. 
[See, e.g., E. Mazur, Exp. Hematol . r 1^:340-350 (1987).] 

Either class of factors is defined by bioassay. 
Factors with meg-CSF activity support megakaryocyte 
colony formation, while factors with TPO activity elicit 
an elevation in the numbers of circulating platelets when 
administered to animals. It is not clear how many 
species of .factors exist _ thatrT*ave either or both of 
these activities. For example, human IL-3 supports human 
megakaryocyte colony formation and, at least in monkeys, 
also frequently elicits an elevation in platelet count. 
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However, IL-3 influences hematopoietic cell development 
in all of the hematopoietic lineages and can be 
distinguished from specific regulators of 
megakaryocytopoiesis and platelet formation which 
interact selectively with cells of the megakaryocytic 
lineage. 

From the studies reported to date, it is not 
clear whether activities identified as meg-CSF also have 
TPO activity or vice versa. Many different reports in 
the literature describe factors which interact with cells 
of the megakaryocytic lineage. Several putative meg-CSF 
compositions have been derived from serum [See, e.g., R. 
Hoffman et al, J. Clin. Invest. . 15:1174-1182 (1985) ; J. 
E. Straneva etal, Exp. Hematol. . 15:657-663 (1987); E. 
Mazur et al, Exp . Hematol . . 12:1164-1172 (1985]. A 
larger number of reports of a TPO factor are in the art. 
[See, e.g. , T. P. McDonald, Exp. Hematol. f 16:201-205 
(1988); T. P. McDonald et al, Biochem. Med. Metab. Biol. . 
37:335-343 (1987) ; T. Tayrien et al, J. Biol. Chem. . 262 : 
3262-3268 (1987) and others]. 

Although there have been numerous additional 
reports tentatively identifying such regulatory factors, 
the biochemical and biological identification and 
characterization of these factors has been hampered by 
the small quantities of the naturally occurring factors 
available from natural sources, e.g., blood and urine. 
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At present there is no identification of a single 
purified factor useful as a meg-cSF or TPO for 
pharmaceutical use in replacing serum-derived products 
or platelets. 

There remains a need in the art for additional 
proteins purified from their natural sources or otherwise 
produced in homogeneous form, which are capable of 
stimulating or enhancing the production of platelets in 
vivo, to replace presently employed platelet 
transfusions. 

Brief Summary of the Invention 

In one aspect the present invention provides a 
novel human megakaryocytopoietic factor (meg-CSF) which 
is substantially free from other human proteins. This 
protein may be produced by recombinant genetic 
engineering techniques. It may also be purified from 
cell sources producing the factor naturally or upon 
induction with other factors. meg-CSF may also be 
synthesized by chemical techniques, or a combination of 
the above-listed techniques. 

The meg-CSF of the present invention has been 
found to stimulate the growth and development of colonies- 
consisting of intermediate and large size megakaryocytes 
in an assay using murine bone marrow target cells. 
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meg-CSF displays biological activity in this assay of 
greater than 5X1 0 7 dilution units per milligram of 
protein. meg-CSF has also displayed activity in an assay 
using human cells, as described in Example 8 below. 

Active meg-CSF has an apparent molecular 
weight of approximately 2 8-38 kd as determined by sodium 
dodecyl sulfate polyacryl amide gel electrophoresis under 
non-reducing conditions. meg-CSF has an apparent 
molecular weight of approximately 20-27 kd as determined 
by sodium dodecyl sulfate polyacrylamide gel electro- 
phoresis under reducing conditions. 

The active approximately 28-38 kd meg-CSF is 
further characterized by comprising all or a portion of 
the sequence of Tables I through IV. Meg-CSF is also 
characterized by comprising at least one of the same or 
substantially the same four amino acid sequences or 
fragments thereof, recited below as sequences (a) through 

Another aspect of the present invention is a 
DNA sequence that encodes the expression of a human meg- 
CSF protein. This DNA sequence may include an isolated 
DNA sequencie that encodes the expression of a human meg- 
CSF protein as described above. The DNA sequence coding 
for all or a portion of the meg-CSF protein is 
characterized as comprising the same or substantially the 
same nucleotide sequence in Table I, II , or III or 
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fragments thereof. This DNA sequence may include 
additional coding sequence. The DNA sequence may also 
include 5 1 and 3 1 human non-coding sequences flanking the 
meg-CSF coding sequence. The DNA sequence may also 
encode an amino terminal signal peptide. Table I 
illustrates a putative partial genomic sequence beginning 
at the extreme 5 1 border of the genomic clone and ending 
at the Bgl II site in the intron occurring between Exons 
II and III. Table I contains sequence encoding the open 
reading frames of Exon I and Exon II. it may also 
contain the N- terminal Exon and any other additional 
Exons which are contained in the genomic sequence 5 1 of 
Exon I which are present in the full length Meg cDNA. 
Table II illustrates the other known Exon III forming a 
partial genomic coding sequence of human meg-CSF isolated 
from human urine and expressed in COS-1 cells. Table III 
illustrates a putative partial CDNA coding sequence 
containing Exons I through III derived from the genomic 
sequence. 

It is understood that the DNA sequence of this 
invention may encodes a biologically active human meg-CSF 
protein and may also comprise DNA sequences capable of 
hybridising under appropriate conditions, -Sr which would 
be capable of hybridizing under said conditions, but for 
the degeneracy of the genetic code, to an isolated DNA 
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sequence of Table I, II , or III. Thus, the DNA sequence 
of this .Invention ^ay include or contain modifications in 
the non-coding sequences, signal sequences or coding 
sequences based on allelic variation, species variation 
5 or deliberate modification. 

Still a further . aspect of the present 
invention is a process for isolating and purifying the 
meg-CSF composition of the present invention or a 
fragment thereof, from human urine. This purification 

10 process provided by the present invention involves the 

steps of concentrating the urine; subjecting it to anion 
exchange column chromatography; followed by cation 
exchange column chromatography; subjecting the resulting 
materials to lectin affinity chromatography followed by 

15 . cation exchange fine performance liquid chromatography 
(FPLC) and three elut ions through reverse phase high 
pressure liquid chromatography (HPLC) using different 
solvent solutions for each HPLC run. 

A further aspect of the present invention is 

20 homogeneous meg-CSF, purified from urine or produced via 
recombinant or synthetic techniques, which is 
characterized by a specific activity in the murine fibrin 
clot assay" of greater than 5X10 7 dilution units/mg. 
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Also provided by the present invention is a 
recombinant DNA molecule comprising vector DNA and an DNA 
sequence encoding human meg-CSF. The DNA molecule 
provides the meg-CSF DNA in operative association with a 
regulatory sequence capable of directing the replication 
and expression of meg-CSF in a selected host cell. Host 
cells transformed with such DNA molecules for use in 
expressing recombinant meg-CSF protein are also provided 
by the present invention. 

The DNA molecules and transformed cells of the 
invention are employed in another aspect, a novel process 
for producing recombinant human meg-CSF protein, or 
peptide fragments thereof. In this process a cell line 
transformed with a DNA sequence encoding expression of 
meg-CSF protein or a fragment thereof (or a recombinant 
DNA molecule as described above) in operative association 
with a suitable regulatory or expression control sequence 
capable of controlling expression of the protein is 
cultured under appropriate conditions permitting 
expression of the recombinant DNA. The expressed meg-CSF 
protein is then harvested from the host cell, cell lysate 
or culture medium by suitable conventional means. The 
condit±uned JaSdium may be processed through the same - " 
purification steps or modifications thereof as used to 
isolate the meg-CSF from urine. This claimed process may 
employ a number of known cells as host cells for 
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expression of the protein. Presently preferred cell 
lines for producing meg-CSF are mammalian cell lines and 
bacterial ceils. 

As still a further aspect of the present 
5 invention, there is provided recombinant meg-CSF 

protein. This protein is substantially free from other 
human proteinaceous materials and comprising a DNA. 
sequence encoding one or more of the peptide fragments or 
sequences described herein. The meg-CSF protein of this 

10 invention is also characterized by containing one or more 

of the physical, biochemical, pharmacological or 
biological activities described herein. 

Another aspect of this invention provides 
pharmaceutical compositions containing a therapeutically 

15 effective amount of homogeneous or recombinant meg-CSF 

or an effective amount of one or more active peptide 
fragments thereof. These pharmaceutical compositions 
may be employed in methods for treating disease states 
or disorders characterized by a deficiency of platelets. 

20 Thus the meg-CSF composition of the present invention 

or pharmaceutical^ effective fragments thereof may be 
employed in the treatment of a plastic anemias resulting 
from chemotherapy or thrombocytopenia. meg-CSF may be 
used as an adjunctive therapy for bone marrow transplant 

25 patients. 
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A further aspect of the invention, therefore, 
is a method for treating these and other pathological 
states resulting from a deficiency of platelets by 
administering to a patient a therapeutically effective 
amount of meg-CSF or one or more peptide fragments 
thereof in a suitable pharmaceutical carrier. These 
therapeutic methods may include administering 
simultaneously or* sequentially with meg-CSF or one or 
more peptide fragments thereof an effective amount of at 
least one other TPO-like factor, a cytokine, 
hematopoietic interleukin, growth factor, or antibody. 

Still another aspect of the present invention 
are antibodies directed against human meg-CSF or a 
fragment thereof. As part of this aspect, therefore, the 
invention claims cell lines capable of secreting such 
antibodies and methods for their production and use in 
diagnostic or therapeutic procedures. 

Other aspects and advantages of the present 
invention will be apparent upon consideration of the 
following detailed description of preferred embodiments 
thereof. 

— Brief De scription of the Prawing a — — - 

Fig. 1 illustrates a restriction map 
illustrating the position of restriction endonuclease 
enzymes of the 18.2 kb genomic clone containing meg-CSF. 
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The lower portion of the Figure illustrates the genomic 
regions in which Exons I, II and III are located, and 
shows their relative positions in the restriction map. 

Detailed Description of the Invention 
5 The novel human megakaryocyte colony 

stimulating factor, meg-CSF, provided by the present 
invention is a homogeneous protein or proteinaceous 
composition substantially free of association with other 
human proteinaceous materials. This protein can be 

10 produced via recombinant techniques to enable large 

quantity production of pure, active meg-CSF useful for 
therapeutic applications. Alternatively this protein may 
be obtained as a homogeneous protein purified from human 
urine or a mammalian cell line secreting or expressing 

15 it. Further meg-CSF or active fragments thereof may be 
chemically synthesized. 

meg-CSF of the present invention is 
characterized by one or more of the following 
biocheiiiical and biological properties: 

20 (1) The composition of the present invention 

has an apparent molecular weight of approximately 28- 
38 kd as determined by 12% sodium dodecyl sulfate 
polyacrylamide gel electrophoresis (SDS-PAGE) under non- 
reducing conditions and by murine fibrin clot 

25 megakaryocyte colony formation bioassay; 
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(2) The composition of the present invention 
has an apparent molecular weight of approximately 20-27 
kd as determined by 12% SDS-PAGE under reducing 
conditions using a variety of reducing agents, e.g., 
beta-mercaptoethanol or dithiothreitol ; 

(3) The composition of the present invention 
has a specific activity in the murine fibrin clot 
megakaryocyte colony formation assay of greater than 
approximately 5X10 7 dilution units/mg protein. 

(4) The meg-csF composition of the present 
invention contains one or more of the same or 
substantially the same amino acid sequences or fragments 
thereof: (a) Ser Arg Cys Phe Glu Ser Phe Glu Arg 

(b) Arg Val Cys Thr Ala Glu Leu Ser Cys Lys 
Gly (Arg) 

(c) Lys Ala Pro Pro Pro (X) Gly Ala Ser Gin 
Thr lie Lys 

(d) Lys Tyr Asp Lys Cys Cys Pro Asp Tyr Glu 
Ser Phe Cys Ala Glu Val His Asn Pro 

(e) an amino acid sequence contained in Table 
I, IA, IB, ic, ii, or III below. 

These sequences or fragments of these sequences 
may also -have biological -5Sr physiological aetivity 
similar to that of the complete meg-CSF protein. In the 
sequences, (X) indicates that the residue is not yet 
absolutely identified, but may be Ser or Thr; and 'J 
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indicates tentative identification of a residue. The 
sequences identified above as (a) through (d) were 
originally determined from purified material from step 8, 
the third HPLC purification step of the purification 
5 procedure, omitting step 7. The same sequences also have 
been obtained from the material, when purified through 
all eight steps. The DNA and amino acid sequences of 
Tables I throt&gh III are discussed in detail below. 

(5) The meg-CSF composition of the present 
10 invention is capable of binding SP-Zeta Prep under 

acidic conditions of pH 4.5. 

(6) The meg-CSF composition of the present 
invention is capable of binding to Wheat Germ-Sepharose 
and Concanavalin-A Sepharose. 

15 (7) The meg-CSF composition of the present 

invention elutes between 23-33% acetonitrile on a 
reverse-phase C4 HPLC column using a solvent system of 
trifluoroacetic acid (TFA) and acetonitrile. 

(8) The meg-CSF composition of the present 
20 invention elutes between 6-15% n-propanol on a reverse- 
phase C18 HPLC column using a solvent system of 
pyridine, acetic acid and n-propanol. 

(9) The meg-CSF composition of the present 
invention elutes between 27-37% acetonitrile on a 

25 reverse-phase C4 HPLC column using a solvent system of 
heptafluorobutyric acid (HFBA) and acetonitrile. 
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The biological activity of the meg-CSF 
composition of the present invention is demonstrated by 
its ability to stimulate the growth and development of 
colonies consisting of intermediate and large size 
megakaryocytes in culture. In the murine fibrin clot 
megakaryocyte colony formation assay, the meg-CSF 
composition of the present invention stimulates 
colonies of an average of 3-6 megakaryocytes. In the 
murine agar meg-CSF assay, the meg-CSF composition of 
the present invention stimulates colonies of 
megakaryocytes. The meg-CSF composition of the present 
invention has inconsistently shown activity in the human 
plasma clot megakaryocyte colony formation assay. 

meg-CSF was originally detected in the urine 
of human patients with bone marrow transplants. These 
patients demonstrate an enhanced level of meg-CSF 
activity. Human meg-CSF was initially purified from this 
human urine by a sequence of purification steps and 
techniques specifically described in Example 1 below. 
However, this factor may also be purified from other 
sources, e.g., human cell lines, or produced via 
recombinant means from those cell lines. 

The. purification techniques -employed in — " 

obtaining meg-CSF from the human urine comprises the 
following steps. The purification steps include 
concentrating pooled bone marrow transplant patient urine 
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through an Amicon YM-10 filter. The concentrated urine 
is passed through an anion exchange chromatographic 
column and the f low^-through is bound onto a cation 
exchange chromatographic column. The urinary protein 
5 eluate was then subjected to pooling, dialyzing and 

heating and applying it to a lectin affinity 
chromatographic column. This eluate is then dialyzed and 
applied to a cation exchange FPLC column. Finally this 
eluate is applied through three cycles of reverse phase 

10 HPLC using different solvent systems. 

Batches with the highest levels of meg-CSF in 
the murine fibrin clot assay, described below, were 
selected for further purification at the semi-preparative 
scale (between 3 0 and 100 liters urine equivalent) to 

15 maximize recovery and yield. 

Thus the homogeneous meg-CSF may be obtained 
by applying the ^above purification procedures, which are 
described in detail in Example 1, to human urine or 
other sources of human meg-CSF, e.g., activated 

20 peripheral blood leukocytes and human placenta. Other 

tissue sources and cell lines such as C10-MJ2 (an HTLV1- 
transformed T cell line) and HEK (primary human embryonic 
kidney cells) may also be sources of this protein. ^ 
Procedures for culturing a cell source which may be found 

25 to produce meg-CSF are known, to those of skill in the 

art. 
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meg-CSF or one or more peptide fragments 
thereof may also be produced via recombinant techniques. 
To obtain the genomic DNA and CDNA sequences for meg-CSF 
or one or more fragments thereof, tryptic digests of the 
purified, sequenced polypeptide were prepared, i.e. the 
tryptics identified as (a) through (d) above, by 
conventional techniques. 

As described in detail in Example 5, below 
oligonucleotide probes were synthesized using the 
genetic code to predict all possible sequences that 
encode the amino acid sequences of the tryptic fragments 
or the above-identified, amino terminal sequence of meg- 
CSF. The probes were employed to screen a human 
placenta lambda phage DNA library (a human genomic 
library). One of the probes hybridized to an 18.2 kb 
genomic DNA insert. A restriction map of this insert is 
illustrated in the top portion of Fig. I. The lower 
portion of the Figure indicates the 5 kb region of the 
restriction map containing genomic sequences encoding 
three putative Exons I, n and III. The genomic 
sequence was obtained in three sections; NotI to EcoRI, 
EcoRi to Bglii and Bglll to Bglll. The sequences were 
connected together at-^tirese sitesTTo generate one long — 
continuous stretch of genomic sequence reported in Table 
I. However it is possible that at these junctions there 
might be a small deletion which is not present in Table 
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The sequence of Table I is a partial sequence 
of the genomic Hieg-CSF clone beginning at the extreme 5' 
border of the clone (adjacent to the 5 1 Not I site of 
Fig. I) . Nucleotides indicated by -1 to -64 are 
5 polylinker nucleotides. The first nucleotide of the 
genomic sequence is indicated as nl in the sequence. 
The Table I sequence ends at the second Bglll site in 
the intron sequence between Exon II and Exon III. This 
sequence contains approximately 7505 nucleotides. Four 

10 enzymes, Kpnl, PvuII, EcoRI, and SnaBI have the ability 
to cut this sequence and NotI present in the polylinker 
of the cloning site, adjacent to the 5' border of the 
genomic sequence can also cut this sequence. Their 
expected cutting sites are indicated- in Table I at 

15 approximately the sites marked by asterisks. It is 

postulated that this sequence contains at least two 
Exons but the location of the 5 1 border of Exon I and 
the presence of any additional Exons has not been 
positively determined. The amino acid sequence of Exon 

20 I is not positively determined. Further the proper 
reading frame of the amino acid sequence of any 
additional Exons contained in this sequence is not 
positively determined. IrTthe absence of positive 
identification of the proper reading frame, Tables IA, 

25 IB, and IC contain putative amino acid sequences for the 
reading frames beginning with the first genomic 
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nucleotide of the genomic clone Table I (reading frame 

A) , with the second nucleotide of Table I (reading frame 

B) and the third nucleotide of Table I (reading frame 

C) , respectively. All three reading frames contain a 
potential N-terrainal Met-containing open reading frame 
sequence identified by computer as a likely Exon 
candidate containing a secretory leader sequence. One 
such sequence spans nucleotide #489 7 through 5073 
(underlined) in Table IA. The amino acid sequence from 
nucleotide #5742 to #5961 of Table I (in Reading Frame 
A) has been determined to correspond to the coding 
region of Exon I. Therefore, Table IA includes Exon I 
(underlined) , and may be the correct reading frame 
sequence, assuming that no nucleotides were deleted or 
repeated incorrectly in the sequence. The amino acids 
in Tables IB and IC corresponding to nucleotides #5742 
to #5961 have been deleted. 

Exon II spans nucleotides #7 340 through #7459 
in Reading Frame B. Other potential open reading frame 
sequences -include in Reading Frame B, a sequence 
spanning nucleotide #4202 through #4465 and a sequence 
spanning nucleotide #4466 through #4915. In Reading 
JEraiire~C, cm such potential^sequence spans nucleotide" ~" 
#2559 to #2732. 

The sequences of Table I through III were 
obtained from partial genomic clones of human meg-CSF 
and contain Exons I, II and III obtained as described 
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below. The region which hybridized to the probe was 
subcloned, sequenced and is illustrated in Table I, with 
the open reading frame labeled Exon II which is found on 
the Bglll fragment appearing directly above the bar 
5 graph labelled II in Fig. I. 

The 18.2 kb genomic DNA fragment also 
hybridized to two additional probes. The hybridizing 
regions were individually subcloned, sequenced and shown 
to contain the sequences illustrated in Table I and II, 

10 respectively, as Exons I and III within open reading 

frames. All four tryptic sequences are present in the 
three Exons. 

Two of the tryptics overlapped intron/Exon 
junctions and define the borders of Exon II. The 

15 resulting putative cDNA sequence and predicted amino 

acid sequence (three letter code) of three Exons (I, II, 
III) in a single reading frame containing the meg-CSF 
partial cDNA sequence are reported in Table III below. 
The partial sequence contains 182 amino acids and 546 

20 nucleotides,, containing all four of the tryptic 

sequences above. The 5 1 and 3 r borders of this cDNA are 
not precisely identified, indicating that an N-terminal 
Met-containing Exon is presently unidentified, as well ~ 
as a possible extra Exon 3* to Exon III. It is ? 

25 presently speculated that a possible site for the 5' 
border of Exon I (i.e., where the N-terminal Met- 
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containing Exon or the 5 '-adjacent Exon would splice) 
occurs in the underlined sequences located in Tables I. 
IB, or IC above (other than the underlined sequence of 
Exons I and II) . 
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TABLE I 

Partial genomic clone 5' NotI to 2nd Bgl II site, 
containing Exon I 

NOtI 

5 * 

AATTCGAGCT CCACCGCGGT GGCGGCCGCG AGCTCTAATA -64 

Sal I nl 

CGACTCACTA TAGGGCGTCG ACTCGATCTT TTTACTCTGA 16 
10 ' Nco I 

AGGACTTTCT CTACTCCTTT AGACCATGGG CAGAAATGTA 56 
SnaBI 

15 CACATTATTG GTCTACGTAG ACAGACAAAT TTGTAATCTC 96 

TGAACTATAA TTTCAAATTT CCAGGAGAAG AAAACATATT 136 

PVUI 

GGCTCGGGTT GTTCAAGTTC CAATTCCTAA TCCTATCAGC 176 

20 TGTGGCTGAT GTGGGAAGAT AACATATATA' AC C AGAGCTA 216 

AAAGGAAAAT AGCCCGATAG AAAGAGAAGT AGTTCCCAGA 256 

AAAGGGGGAT TATTTAGAGA TGAGCAGATA GCCTCGAAAT 296 

TGTCTTTACA TATTTAGTCC TGTGAAATAA GCACTGTAGA 336 

AAGAAAATAG ATGCTTATAA TTCTAGTCTT AAAAAAGTCA 376 

25 CAATCTCACA AACCTATGTA TACATGTAGA GTAAAGGAGT 416 

AAAATTTAGT ATAATGATTA CAAATTAAGT GATGGTTTTG 456 

ATCAAAAGGG AGCTGGATGT AGTGGAAACA 1GACGGAACT 496 

TAAAATGCAA AGACATGGGT TCAAATCTTG GTTCTGGCCT 536 

CTAGTCTTTA TAGATCCTTG GATGGGTCAC ACATTCTTCA 576 

30 TTTGTGGGTC TTCATCTTCT TATTTGTAAA TGTGGATGAT 616 
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TATATTTAAC 


TTTAAGACAT 


TTTGTGAGAA 


TTAAATGCAG 


656 


TACCTACCAA 


ATGAGTAGTA 


GACAGCAGAA 


TCCAGCCTAC 


696 


AACCATGTGG 


TTCACTGAAC ATGTTTTCTT TTTTTTCTTT 


736 


TCTTTTTTTT 


TTTTTGAGAC 


TGAGTTTTGC 


TCTTGTTGCC 


776 


CAAGCTGGTG 


TGCGGTGGCG 


TGATCTCAGC 


TCACTGCAAC 


816 


CTCTGCCTCT 


CAGGTTCAAG 


TGATTCTCCT 


GCCTTAGCCT 


856 


CCCGAGTAAC 


TGGAATTACA 


GGCGCCTGCC 


ACCACACCCG 


896 


GCTAATTTTT 


TGTATTTTTA 


GTAGAGAGGG 


AGTTTCACCA 


936 


TGTTGGTCAG 


GCTGGTCTCA 


AACTCCTGAC 


CTCAGGTGAT 


976 


CTGCCCACCT 


CAGCCTCCCA 


AAGTGCTGGG 


ATTGCAGGGG 


1016 


TGAGCCACTG 


CACCCGGCGT 


GAGCATGTTT 


TAAACATATA 


1056 


ACATTGGTGA 


AACAACTGGG 


CTGTATGTTG 


GCAATAGACT 


1096 


AACATATAGG 


TGTTGGTTGT 


CCTTAGGGAA ATCATAGTCT 


1136 


GTTGGGGGTG 


TGAGACAGAT 


GAAGAGATAT 


TTTAGTAAAA 


1176 


CGTAGTACAA 


TTGTTTACTA TAATTCTAGA TGTGATTGGA 


1216 


ATTTGATCTG 


GAAAGGTTTC 


AGGGAGGTGC 


TAGGACATGA 


1256 


GCGAAGCCTT 


ATGTTGGTGG 


GTAAGATTTT 


ACACCGATAC 


1296 


ATTCCGTATG 


AAAGTGGGGT 


AAAGCCTTGG 


GCTGAGCTGA 


1336 


CTTAGCTCTG 


CAATGGTGAT 


GGTTCACATT 


TTCATTGTGG 


1376 


GAAGGAAACA 


AAATTAGTAA 


TTTAAAAGAC 


CTTTTCCAAA 


1416 


CATGCTTATC 


TTAAAGGAAG 


GATCTAAGGA AAATATCAAT 


1456 


ACATTTTTTA AGCAGATTTT 


TAAAATAATA 


AAGGAAGCTA 


1496 


GAAAATCCHG 


TGTGAATAAT CTA6CTGTAG GTAACTGAGA 


1536 


GCATGTTTTC 


TGTGCTCAGA 


GAAAATGAAA 


CTACTCAAGA 


1576 
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5 



AATGTATATT 


TGATATAATG 


. GTATAAGTAT 


ATAAAATGTA 


1616 


TGCCACAAGC 


TAACACTAGG 


TAAATTTCAA 


AGCTACCTTT 


1656 


GCTAGATTTA 


ACTGGAAAGC 


AGAAGAAATA 


AATGATACCA 


1696 


GTATGTGAAT 


ATTTTAAAGT 


TGCTGCTTCA 


ATATACTAGT 


1736 


CAAAGTCTGC 


CAGAGCTATT 


TACCATCTCA 


GGACAATTTG 


1776 


TTCACTTCAC 


TTAGAAATTA 


GAAATGTGCT 


CTGGCCAATA 


1816 


CATCTAAATT 


TATTTTGTGG 


TAGTCATTTT 


GCTTAATGGA 


1856 


AGTGTTTTCC 


AGATGTTGCC 


TAAGTCTAGT 


CGTCTGGGCC 


1896 


CTTTTCCAAT 


AAATGAGGAA 


AGTTTGATTT 


CATAGGTTGT 


1936 


CACTGTTGAT 


TTTGTCTAAC 


CTTTGGACTA 


ATTGGTTCAT 


1976 


CCCACTGTAT 


TTGCACTGAT 


ATATAAGACT 


CCCAGGACGT 


2016 


GGGATAAACT 


CATCTATCCT 


TTACGGGTAA 


GTGTCAAAGT 


2056 


TAACTTGCTT 


CATAGAATTA 


AATGTGTTTT 


CATTAGAGGT 


2096 


GTTTGAAAAA 


AAATGTGTAA 


AGATAGTAGT 


TGGAAATTTT 


2136 


TGAAAAGGAT 


TXTGTTTATG 


CAATACAGCT 


GTTGGAAGCC 


2176 


TTTGAATTTA 


TATTGAGAAT 


TAAGAAAAAG 


TTGGAACAGT 


2216 


AACTCCATGA 


TGCTTATTAA 


ATTACATTTC 


TGTGACACAG 


2256 


GTTATTATTT 


TCCTTAAGAT 


AAAATTTTAA 


CCTTGCACTG 


2296 


TTAAGTACAT 


GQCATACTTT 


GGCTAGAGTT 


TTAAGATAAG 


2336 


TCTATTCTAT 


AT31QGAGATT 


TCTTAAGAAT 


TAAAACTGTC 


2376 


AATAACGTGG 


CTGAGGGTGA 


CTTGATTTGT 


TCTGCATAAG 


2416 


ATTAAGTCTA 


ATGGCCAAAT 


ATTTTCTATA 


AAATTAAAGC 


2456 


TGATTTAAAG 


TCTTGTTTCA 


AGAATGGGAT 


GGTTTTATTA 


2496 


TTGTTACATT 


TTTAAAGTTA 


CTGAAATGTG 


TATAATGCAA 


2536 
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GCCTAAGTTA GTGGTGAGAT G AAA GAG CTG TTTTCTGATA 2576 

Kpnl Kpnl 

CTTTTATTTT ATTTTCAGCA AGGGTACCTA CGGTACCTGA 2616 

5 AAACAACGAT GGCATGGAAA ACACTTCCCA TTTACCTGTT 2656 

GTTGCTGCTG TCTGTTTTCG TGATTCAGCA AGTTTCATCT 2696 

CAAGGTAGCT TAACCATCGA ACATACTTTT ATTTAACAAC 2736 

TATTGCTAAT CATTCAGTCT TGATTTTTAT AACAACGGAA 2776 

10 ATATATTTCT AAAAATTTAT ATTTGCTTGA GTTTAATATA 2816 

CTGTACAACC TTAAAATAAT ATATAACATG TAGCCTGTTT 2856 

Eco RI 
* 

GTAAGTGCTT TGGAATTCTC ATCTTTTTCT CCTTTTGAGA 2896 

15 GTTTTGAGCT TGTATAAAGA ATAATTTTGG TACTTGTTAT 293 6 

TTCTATGGTT CTCTCGAAAG TTTAAAAAGT GCAGACAGTT 2976 

CATCATAAAA TTATGCTGGA GCTATATAGT ATCATAAAAA 3016 

TAACTAAAAA AGAACAATAT TTAGATAACA TGTAATTTTT 3056 

GCCTATATGT TGAACAATTT GTTTTTTAAA ATCAAATAAA 3096 

20 TTATTGATAA ATAACTTATT TTGGCATCAT TTAGACATAG 3136 

ATACCATATT ATAATAAACT ATAGTGTTTC AAATGGCATT 3176 

TTATTAGTCT TGAAAATGTT CTCTTTTTCT TACAAGAACT 3216 

ATTTATCCAG GTATGGACAT ATACAGATGT GACTGTCATT 3256 

TTGTGTTATA ACATTAAACT TTTCACAGTT CTCCCAAAAC 3296 

25 AGGTCCTGGA TGAATAGTTC TTGTTACTCA TTTTTAGAGA 3 336 
- CTGTTAAAGT ACATTCAGTG ftAATCACATT AGTAAAJkGAT - ~3 376 

GCTAGCATGC CATCTAGGGG GATTAATAGT AACAATGCCA 3416 

AGCTTTGGAT TTTTCTTTTC TTTTCTTTTC CTTTTGTTCC 3456 
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CCTTTGATAA 


GCAAAATCTG AGAGAATAAA 


AATCAAGATT 


3496 


CATGACAGTT 


ATGATGAAAT TATGTTTCTA 


AAGTAAACAT 


3536 


TTCCAGTAAA 


ATACGAGATT CTTATGAAAC 


CTGAACATGA 


3576 


GTGGTAACTG 


TCTGCATAGG CATAAGTTGC 


AGAATTGTTT 


3616 


AGATAAGAAA 


AGACAGGAAA ACACATGAAA 


GCAAATGTGA 


3656 


ATATTCAATA 


AGAATGATGA CTACTCCAGT 


ATCTCCAGAC 


3696 


CCTTCGGCTT 


TCTCGTAACA CTATGACAAG 


GTTCACAACA 


3736 


CTGGGGGCAC 


TTTCTAAACT GCCTTTTCCT 


CTGTGATACA 


3776 


ATTGGTTGTT 


CACTAAAACA GTGTTACTTT 


CATTTTAATT 


3816 


GTGATTAAAT 


AAATCAAATT AAAATTAATG 


GGGCTGGGTG 


3856 


TGGTGGCTCA 


CACCTGTAAT CCCAGCATTC 


TGGGAGGCCT 


3896 


AGGGAGACAG 


ATCACCTGAG GTCTGGAGTT 


CAAGAGCAGC 


3936 


CTGGGCAACA 


TGGCAAAATC TTGTCTCTAT 


ACAAAATACA 


3976 


AAAAAATTAG 


CCAGGTGTGG TAGTGTGTGC 


CTGTAATCCC 


4016 


AGCTATTCGG 


AGGCTGAGGC AAGAGGATAT 


CTTGAATCCA 


4056 


GGAGGTGGAG 


GTTGCATTAA GCTGAGCTTG 


CATCATGGCA 


4096 


CTCCAGCCTG 


GGCAACAGAG TGAGACTCTG 


TCTCAAAAAT 


4136 


AAATAAATAA 


ATAAATAAAA TTAATGGTTA 


CAATTAATAG 


4176 


CAATAAAAGT 


AGGACAGACA CCTAATCTAT 


GAAAGTAAGC 


4216 


TTTTCCTGTA 


AGGATGATTT CCTCTTTTTT 


TTTTTTCTTT 


4256 


TGAGACAGGG 


TCTTGCTTTG TCTCCCAGTC 


TGGAGTGCAG 


4296 


TGGCACAATC 


TTGGCTCACT GCAACCTCCA 


CCTACGGGGT 


4336 


TCAAGCGATT 


CTCCTGTCTC AGCCTCCCGA 


GTAGCTGGGA ~~ 


4376 


TTACAGGCCC 


QTGCCACGAT GCCCGGTTAA 


TTTTTGTATT 


4416 
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* 


TTTAGTAAAG 


ACAGGTTTTA 


ACCATGTTGG 


CCAGGCTGGT 


4456 




CTCACCTGAC 


CTCAGGTGAT 


CCGCCCGCCT 


CAGCCTCCCA 


4496 




AAGTGCTGGA 


ATTACAGATG 


ATTTCTTATT 


TCAGAAATCT 


4536 




GCCAACTATA 


AAAGAGCAAT 


CTCTTGATAC 


TGTCTTGTCT 


4576 


5 


GCTTCTCTTG 


CTTTCTCAAC 


CTCTTCTCAT 


TCTCTTTTTT 


4616 




CTTTATATAT 


AATATATGTA 


TTTATATACA 


TACACTATAT 


4656 




ATACATTTTT 


GTATGCATTA 


TGCACTCATG 


TACGCAAAAA 


4696 




GTTCTGAAAG 


TTGTCCTACA 


ATTTACTGTT 


TTATTTGCAT 


4736 




ATTCAGACTT 


TGGCATTCCT 


GGACTCTATT 


CTTTTAAGAT 


4776 


10 


TTGTTTTCAG 


TGTGTTTCAA 


CATTCCTTTG 


TGGATTTAGG 


4816 




ACAGTACACC 


TGCCAATTTC 


TATTCCAGGG 


ATGGATTCCA 


4856 




TTGTCACATT 


TCTGCAGTCA 


TTTCTCAGGG 


AGGGTTTTAA 


4896 




GGTGGTGTTT 


TCCAAATGAC 


TTTTAAAAAA 


TATTTGAGAA 


4936 




TGAGTATGCT 


TTTGTTATTG 


TTTGCTTATT 


TGTTGCCTGG 


4976 


15 


AAAACCTCTG 


TTCATGACCT 


TTATAGTTAT 


GCCCAGTTAT 


5016 




TTTAAGGGTT 


CTCACTTCAG 


CAGTGGTTGT 


AAGCATCTGC 


5056 




CTCCCCTGAC 


ATTTTAAAAA 


TCCAGTTATC 


CCCACCAATT 


5096 




AAGGAAGAAA 


AAGCTCAGTA 


AAGTTTATGC 


TGGTGCAACC 


5136 




AACGTGCTTT 


AAACTTATCC 


TCAGGAATGG 


GAGGCAAAGC 


5176 


20 


GACAGGTGGA 


AACATCTCAG 


GCTTAAAATA 


AGATATATCA 


5216 




GAGTTCAAAT 


TCTGGATGGA 


TTGTTTACTT 


AAGGTGACCA 


5256 


< 


TAAAATGTAT TATCTAAATT 


AGGACAATTA 


TAAGGGTAAA 


5296 




AGAGTTCACT -ACAAA*FAAl i T~ 


GGGAG0TATT 


AATTAACCTA — 


5336 




TGATGCCAAT 


ATACTGGAAC 


TATTAACTAG 


TCGTGTGATA 


5376 
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ATGAGCAAGT TGTCAAAATT TCTGTGTTTC AGCTTTCTTA 5416 

GCTCTAAAAA AGGGCTACTA AATCCTGGAA ACATTTTCAT 5456 

AAGCATTAGT GATAATGTAT GACTCAGCAC CTGGCATAGA 5496 

GAAGCTCAAT AAATGGCAAC TGCTAATCAT CAAAACCACA 5536 

5 GATCGGTAGT AGCTGTAGCT GCCACCTCTT CCTTAGGAGT 5576 

ATCACCCTCT TTAGGCAAAG CCATTACTTT GCCTGGTTTT 5616 

CAAAAATGTG TTTATCATCT CAGTCTAGTG AAGAATAAAG 5656 

TGACATAATT GAGATCACCT AAGACATCAG CCAAATATCA 5696 

PvuII 
10 * 

GCTGGGCCTA TTGCTGACAT CATTCCAACA CCTTCTCGAT 5736 



CAATAA AAT TCT CTC TCA CCA AGT GGC TTT GTC CCC CTC 5775 
Asn Ser Leu Ser Pro Ser Gly Phe Val Pro Leu 





GTT 


AGA 


TTG CTC 


CCT 


TTC 


TAT 


AAA 


GTG 


GTT 


TGG CCA 


TAT 


5814 


15 


Val 


Arg 


LSu Leu 


Pro 


Phe 


Tyr 


Lys 


Val 


Val 


Trp Pro 


Tyr 






TTA 


CGC 


CAG TAT 


TGT 


ATA 


ATT 


TTA 


GAT 


TTA 


TCA AGC 


TGT 


5853 




Leu 


Arg 


Gin Tyr 


Cys 


He 


He 


Leu 


Asp 


Leu 


Ser Ser 


Cys 






GCA 


GGG 


AGA TGT 


GGG 


GAA 


GGG 


TAT 


TCT 


AGA 


GAT GCC 


ACC 


5892 




Ala 


Gly 


Arg Cys 


Gly 


Glu 


Gly 


Tyr 


Ser 


Arg 


Asp Ala 


Thr 




20 


TGC 


AAC 


tgt.Jgat 


TAT 


AAC 


TGT 


CAA 


CAC 


TAC 


ATG GAG 


TGC 


5931 




Cys 


Asn 


Cys Asp 


Tyr 


Asn 


Cys 


Gin 


His 


Tyr 


Met Glu 


Cys 






TGC 


CCT 


GAT TTC 


AAG 


AGA 


GTC 


TGC 


ACT 


GCG 


GGTAAGTCCT 


5971 




Cys 


Pro 


Asp Phe 


Lys 


Arg 


Val 


Cys 


Thr 


Ala 









GAGAGCGGGT 


GTCTCCTCTG 


TCAAGCAACA 


CTGCGAGTCT 


6011 


GTGAGTCCCC 


eCTTGCACCC 


TCGTGCAATG 


CTGTGAGACT 


6051 


GAGCCTCCCC 


TTGCACCCAC 


TTGCAGTGCT 


GTTTTCCCAC 


6091 


TGTATCACTT 


"CCTTTGCTTA 


AGTGACTTTC 


CTTCACTTGC 


6131 


ATAAATGTTG 


CAGCGCATTG 


TACCTAAGGA 


TGGTGTCAAA 


6171 


GCTTGTGATT 


ATTGGGGAGG 


GTAAGGGGAG 


CCTGGAAGTC 


6211 
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TGTCACTTGC ACAAGGTTAT TGTCTAGTTC CAAGTAAATG 6251 

TCTTATTTTA AATAGAGAAG ATACTGTGAG TCAGGAGTCA 6291 

TTAAGTAAAT AGTAAGGAGT AATTATTTAC AATCAGAGAT 6331 

GTCCTCGACT AGTAATGAGG ACTTTCAATA GTTTCATCAA 6371 

5 AAGAATACAT GAGCATGGAA TAATCTTGTC TACCATTGGA 6411 

CTTTCTGATG TTAACACGTT TGTAAATTAC CTGGAAAAGA 6451 

CACTTTAAAG ATAAAGTGCT AGTGTTTAAT GATGAATTTC 6491 

AGTTAAAATC TGAAGGTCTG AATTTTAAAT TATGAGTGGT 6531 

GTTTCACTTG AATATATTTT GGGCCATTTT ATTATTAACA 6571 

10 CAGAAACATT TCCTATTCAA TTTAAGAAAA TTGTAATTAA 6611 

AGTTATTTGA AAGGTGTATT TGCTTTAGAA TTTGAATGTG 6651 

AAGTTGAGGA ATATATTGTG CAATTTATGT ATTGGATAGG 6691 

TAATTATTTC AAATAAGCCT TGGTAAGTCC CTGTTAACTC 6731 

TAACAAAGGC TTTTTAAATT TTCATTTTTT AAATGAGTAG 6771 

15 TCCTCCCTAA CATAGTCCAC ACTGTAAGAT TAGGCTGAAA 6811 

GCTTTCAACT ATACACCTTC CCCAGTTTGT CATAATAAGG 6851 

GCCTCTGCAT TACAAATGAT TTTAAGCCTC AAAAATGACC 6891 

CATTTACGTG GAATATATAT ACATATATAT TTATATGGAA 6931 
Bglll 

20 * 

TATATTGCTG AGATCTGTAC ATTCAATTAC TGTGAATCTA 6971 

TTACAAAGCA GTGTGTGAAG AGGAGAAGGA TGAAGAGATT 7011 

TCATATGAAG GCTATCTCAC TATCTAGACA TTTCCCGATT 7051 

TTTCTTTGTC CATACATGTA AATAACTCGG GCAGeATCAG 70^: ~ 

25 GATGTCTCTT GGAGTCTGGA AGGGCAAGAG GAGTTGCCCT 7131 

CAGTCACCAT ATTTCTTTTT TGACTTGGGC TGTCTCCATC 7171 
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TGGGATACCA TCTAATTTTT CCTGGATGAT GTACTCCAAA 7211 

TTTCAAATAA AAGACTTAGA AATGAACTTT TGGAAACCTA 7251 

GTCAAGTCTA SGGTGGGAAA TGGCTGTCAA ATACGTGGGC 7291 

CTGGCTTCAC AATGAATAAT CTGTAACTTC TTGTTTTGCT 7331 



5 CTGGGTA GAG CTT TCC TGT AAA GGC CGC TGC TTT GAG TCC 7371 

Glu Leu Ser Cys Lys Gly Arg Cys Phe Glu Ser 

TTC GAG AGA GGG AGG GAG TGT GAC TGC GAC GCC CAA TGT 7410 
Phe Glu Arg Gly Arg Glu Cys Asp Cys Asp Ala Gin Cys 

AAG AAG TAT GAC AAG TGC TGT CCC GAT TAT GAG AGT TTC 7449 
10 Tyr Lys Lys Asp Lys Cys Cys Pro Asp Tyr Glu Ser Phe 

TGT GCA GAA GGTAAGCATC ACAGTACCAA CCAATGCTTC 7488 
Cys Ala Glu 

figlli- 
* 

15 TCAGTACAGC CAGATCT 7505 
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10 



TABLE IA 

Reading frame A - (nucleotides #1 - 7503) 
nl 

Asp Leu Phe Thr Leu Lys Asp Phe Leu Tyr Ser Phe Arg Pro 
Trp Ala Glu Met Tyr Thr Leu Leu Val Tyr Val Asp Arg Gin 
lie Cys Asn Leu End Thr He He Ser Asn Phe Gin Glu Lys 
Lys Thr Tyr Trp Leu Gly Leu Phe Lys Phe Gin Phe Leu He 
Leu Ser Ala Val Ala Asp Val Gly Arg End His He End Pro 
Glu Leu Lys Gly Lys End Pro Asp Arg Lys Arg Ser Ser Ser 
Gin Lys Arg Gly He He End Arg End Ala Asp Ser Leu Glu 
He Val Phe Thr Tyr Leu Val Leu End Asn Lys His Cys Arg 
Lys Lys He Asp Ala Tyr Asn Ser Ser Leu Lys Lys Val Thr 
He Ser Gin Thr Tyr Val Tyr Met End Ser Lys Gly Val Lys 
Phe Ser He Met He Thr Asn End Val Met Val Leu He Lys 
15 Arg Glu Leu Asp Val Val Glu Thr End Arg Asn Leu Lys Cys 

Lys Asp Met Gly Ser Asn Leu Gly Ser Gly Leu End Ser Leu 
End He Leu Gly Trp Val Thr His Ser Ser Phe Val Gly Leu 
His Leu Leu He Cys Lys Cys Gly End Leu Tyr Leu Thr Leu 
Arg His Phe Val Arg lie Lys Cys Ser Thr Tyr Gin Met Ser 
20 Ser Arg Gin Gin Asn Pro Ala Tyr Asn His Val Val His End 
Thr Cys Phe Leu Phe Phe Leu Phe Phe Phe Phe Phe End Asp 
End Val Leu Leu Leu Leu Pro Lys Leu Val Cys Gly Gly Val 
He Ser Ala His Cys Asri Leu Cys Leu Ser Gly Ser Ser Asp 
Ser Pro Ala Leu Ala Ser Arg Val Thr Gly He Thr Gly Ala 
25 Cys His His Thr Arg Leu He Phe Cys He Phe Ser Arg Glu 

Gly Val Ser Pro Cys Trp Ser Gly Trp Ser Gin Thr Pro Asp 
Leu Arg End Ser Ala His Leu Ser Leu Pro Lys Cys Trp Asp 
Cys Arg Gly Glu Pro Leu His Pro Ala End Ala Cys Phe Lys 
His He Thr Leu Val Lys Gin Leu Gly Cys Met Leu Ala He 
30 Asp End His He Gly Val Gly Cys Pro End Gly Asn His Ser 

Leu Leu Gly Val End Asp Arg End Arg Asp He Leu Val Lys 
Arg Ser Thr He Val Tyr Tyr Asn Ser Arg Cys Asp Trp Asn 
Leu He Trp Lys Gly Phe Arg Glu Val Leu Gly His Glu Arg 
Ser Leu Met Leu Val Gly Lys He Leu His Arg Tyr He Pro 
3 5 Tyr Glu Ser Gly Val Lys Pro Trp Ala Glu Leu Thr End Leu 

Cys Asn Gly Asp Gly Ser His Phe His Cys Gly Lys Glu Thr 
Lys Leu Val lie End Lys Thr Phe Ser Lys His Ala Tyr Leu 
Lys Gly Arg He End Gly Lys Tyr Gin Tyr He Phe End Ala 
Asp Phe End Asn Asn Lys Gly Ser End Lys He Gin Cys Gin 
40 End Ser Thr Cys Arg End Leu Arg Ala Cys Phe Leu Cys Ser 
Glu Lys Met Lys Leu Leu Lys Lys Cys He Phe Asp He Met 
Val End Val Tyr Lys Met Tyr Ala Thr Ser End His End Val 
Asn Phe Lys Ala Thr Phe Ala Arg Phe Asn Trp Lys Ala Glu 
Glu He Asrr Asp. Thr Ser MsfEnd lie Phe End~-Ser Cys Cys 
45 Phe Asn He Leu Val Lys Val Cys Gin Ser Tyr Leu Pro Ser 

Gin Asp Asn Leu Phe Thr Ser Leu Arg Asn End Lys Cys Ala 
Leu Ala Asn Thr Ser Lys Phe He Leu Trp End Ser Phe Cys 
Leu Met Glu Val Phe Ser Arg Cys Cys Leu Ser Leu Val Val 
Trp Ala Leu Phe Gin End Met Arg Lys Val End Phe His Arg 
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Leu 


Ser 


Leu 


Leu He 


Leu 


Ser 


Asn 


Leu Trp 


Thr 


Asn 


Trp 


Phe 




lie 


Pro 


Leu 


Tyr Leu 


His 


End 


Tyr 


He Arg 


Leu 


Pro 


Gly 


Arg 




Gly 


He 


Asn 


Ser Ser 


He 


Leu 


Tyr 


Gly End Val 


Ser 


Lys 


Leu 




Thr 


Cys 


Phe 


lie Glu 


Leu 


Asn 


Val 


Phe Ser 


Leu 


Glu 


Val 


Phe 


5 


Glu 


Lys 


Lys 


Cys Val 


Lys 


He 


Val 


Val Gly Asn Phe 


End 


Lys 




Gly 


Leu 


Cys. Leu Cys 


Asn 


Thr 


Pro 


Val Gly Ser Leu 


End 


He 




Tyr 


He 


Glu, 


Asn End 


Glu 


Lys 


Val 


Gly Thr Val 


Thr 


Pro 


End 




Cys 


Leu 


Leu 


Asn Tyr 


He 


Ser 


Val 


Thr Gin 


Val 


He 


He 


Phe 




Leu 


Lys 


lie 


Lys Phe 


End 


Pro 


Cys 


Thr Val 


Lys 


Tyr 


Met 


Pro 


10 


Tyr 


Phe 


Gly. End Ser Phe Lys 


He 


Ser Leu 


Phe 


Tyr 


He 


Gly 




Asp 


Phe 


Leu 


Arg He 


Lys 


Thr 


Val 


Asn Asn 


Val 


Ala 


Glu 


Gly 




Asp 


Leu 


He 


Cys Ser 


Ala 


End 


Asp 


End Val 


End 


Trp 


Pro 


Asn 




He 


Phe 


Tyr 


Lys He 


Lys 


Ala 


Asp 


Leu Lys 


Ser 


Cys 


Phe 


Lys 




Asn 


Gly 


Met 


Val Leu 


Leu 


Leu 


Leu 


His Phe 


End 


Ser 


Tyr 


End 


15 


Asn 


Val 


Tyr Asn Ala 


Ser 


Leu 


Ser 


End Trp 


End 


Asp 


Glu 


Arg 




Ala 


Val 


Phe 


End Tyr 


Phe 


Tyr 


Phe 


He Phe 


Ser 


Lys 


Gly 


Thr 




Tyr 


Gly 


Thr 


End Lys 


Gin 


Arg 


Trp 


His Gly Lys His 


Phe 


Pro 




Phe 


Thr 


Cys 


Cys Cys 


Cys 


Cys 


Leu 


Phe Ser 


End 


Phe 


Ser 


Lvs 




Phe 


His 


Leu- 


Lys Val 


Ala 


End 


Pro 


Ser Asn 


He 


Leu 


Leu 


Phe 


20 


Asn 


Asn 


Tyr 


Cys End 


Ser 


Phe 


Ser 


Leu Asp 


Phe 


Tyr 


Asn 


Asn 




Gly 


Asn 


He 


Phe Leu 


Lys 


lie 


Tyr 


He Cys 


Leu 


Ser 


Leu 


He 




Tyr 


Cys 


Thr 


Thr Leu 


Lys 


End 


Tyr 


He Thr 


Cys 


Ser 


Leu 


Phe 




Val 


Ser 


Ala 


Leu Glu 


Phe 


Ser 


Ser 


Phe Ser 


Pro 


Phe 


Glu 


Ser 




Phe 


Glu 


Leu 


Val End 


Arg 


He 


He 


Leu Val 


Leu 


Val 


He 


Ser 


25 


Ket 


Val 


Leu 


Ser Lys 


Val 


End 


Lys 


Val Gin 


Thr 


Val 


His 


His 




Lys 


lie 


Met Leu Glu 


Leu 


Tyr 


Ser 


He He 


Lys 


He 


Thr 


Lys 




Lys 


Glu 


Gin 


Tyr Leu 


Asp 


Asn 


Met 


End Phe 


Leu 


Pro 


He 


Cys 




End 


Thr 


He 


Cys Phe 


Leu 


Lys 


Ser 


Asn Lys 


Leu 


Leu 


He 


Asn 




Asn 


Leu 


Phe Trp His 


His 


Leu 


Asp 


He Asp Thr 


He 


Leu 


End 


30 


End 


Thr 


He, 


; Val Phe 


Gin 


Met 


Ala 


Phe Tyr End Ser 


End 


Lys 




Cys 


Ser 


Leu 


Phe Leu 


Thr Arg 


Thr 


He Tyr Pro Gly 


Met 


Asp 




lie 


Tyr 


Arg 


Cys Asp 


Cys 


His 


Phe 


Val Leu 


End 


His 


End 


Thr 




Phe 


His 


Ser 


Ser Pro 


Lys 


Thr 


Gly 


Pro Gly End 


He 


Val 


Leu 




Val 


Thr 


His 


Phe End 


Arg 


Leu 


Leu 


Lys Tyr 


He 


Gin 


End 


Asn 


35 


His 


He 


Ser 


Lys Arg 


Cys 


End 


His 


Ala He 


End 


Gly 


Asp 


End 




End 


End 


Gin 


Cys Gin 


Ala 


Leu 


Asp 


Phe Ser 


Phe 


Leu 


Phe 


Phe 




Ser 


Phe 


Cys 


Ser Pro 


Leu 


He 


Ser 


Lys He 


End 


Glu 


Asn 


Lys 




Asn 


Gin 


Asp 


Ser End 


Gin 


Leu 


End 


End Asn 


Tyr 


Val 


Ser 


Lys 




Val 


Asn 


lie 


Ser Ser 


Lys 


He 


Arg 


Asp Ser 


Tyr 


Glu 


Thr 


End 


40 


Thr 


End 


Val 


Val Thr 


Val 


Cys 


He 


Gly He 


Ser 


Cys 


Arg 


He 




Val 


End 


lie Arg Lys 


Asp Arg 


.Lys 


Thr His 


Glu 


Ser 


Lys 


Cys 




Glu 


Tyr 


Ser : He Arg Met Met 


Thr 


Thr Pro 


Val 


Ser 


Pro 


Asp 




Pro 


Ser 


Ala 


j.phe Ser 


End 


His 


Tyr 


Asp Lys 


Val 


His 


Asn 


Thr 




Gly 


Gly 


Thr" 


j?he End_ 


Th*^Ala 


P^e 


Ser Ser 


Val 


Ile-~Gln 


Leu 


45 


Val 


Val 


His 


"End Asn - 


Ser 


Val 


Thr 


Phe He 


Leu 


He" 


Val 


He 




Lys 


End 


lie 


Lys Leu 


Lys 


Leu 


Met 


Gly Leu 


Gly 


Val 


Val 


Ala 




His 


Thr 


Cys 


Asn Pro 


Ser 


He 


Leu 


Gly Gly 


Leu 


Gly 


Arg 


Gin 




lie 


Thr 


End 


Gly Leu 


Glu 


Phe 


Lys 


Ser Ser 


Leu 


Gly 


Asn 


Met 




Ala 


Lys 


Ser 


Cys Leu 


Tyr 


Thr 


Lys 


Tyr Lys 


Lys 


He 


Ser 


Gin 


50 


Val 


Trp 


End 


Cys Val 


Pro 


Val 


He 


Pro Ala 


He 


Arg 


Arg 


Leu 



3 
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Arg Gin Glu Asp He Leu Asn Pro Gly Gly Gly Gly Cys He 
Lys Leu Ser Leu His His Gly Thr Pro Ala Trp Ala Thr Glu 
End Asp Ser Val Ser Lys He Asn Lys End He Asn Lys He 
Asn Gly Tyr Asn End End Gin End Lys End Asp Arg His Leu 
He Tyr Glu Ser Lys Leu Phe Leu End Gly End Phe Pro Leu 
Phe Phe Phe Ser Phe Glu Thr Gly Ser Cys Phe Val Ser Gin 
Ser Gly Val Gin Trp His Asn Leu Gly Ser Leu Gin Pro Pro 
Pro Thr Gly Phe Lys Arg Phe Ser Cys Leu Ser Leu Pro Ser 
Ser Trp Asp Tyr Arg Pro Leu Pro Pro Cys Pro Val Asn Phe 
Cys He Phe Ser Lys Asp Arg Phe End Pro Cys Trp Pro Gly 
Trp Ser His Leu Thr Ser Gly Asp Pro Pro Ala Ser Ala Ser 
Gin Ser Ala Gly He Thr Asp Asp Phe Leu Phe Gin Lys Ser 
Ala Asn Tyr Lys Arg Ala lie Ser End Tyr Cys Leu Val Cys 
Phe Ser Cys Phe Leu Asn Leu Phe Ser Phe Ser Phe Phe Phe 
15 He Tyr Asn He Cys He Tyr He His Thr Leu Tyr He His 

Phe cys Met His Tyr Ala Leu Met Tyr Ala Lys Ser Ser Glu 
Ser Cys Pro Thr He Tyr Cys Phe He Cys He Phe Arg Leu 
Trp His Ser Trp Thr Leu Phe Phe End Asp Leu Phe Ser Val 
Cys Phe Asn He Pro Leu Trp He End Asp Ser Thr Pro Ala 
Asn Phe Tyr Ser Arg Asp Gly Phe His Cys His He Ser Ala 



20 



25 



30 



35 



40 



114897 



Val 


He 


Ser 


Gin 


Gly Gly Phe End Glv Glv Val 


Phe 


Gin Met 


Thr 


Phe 


Lvs 


Lys 


Tvr Leu Arcr Met Ser Met Leu 


Leu 


Leu Leu 


Phe 


Ala 


Tyr 


Leu 


Leu Pro Glv Lvs Pro Leu Phe 


Met 


Thr Phe 


lie 


Val 


Met 


Pro 


Ser Tvr Phe Lvs Glv Ser His 


Phe 


Ser Ser 



n5073 



Pro 


Pro 


Leu 


Thr 


Phe 


* 

End 


Lys 
Lys 


Ser 
Val 


Ser 
Tyr 


Arg 


Lys 


Lys 


Lys 


Leu 


Ser 


Val 


Leu 


End 


Thr 


Tyr 


Pro 


Gin 


Glu 


Trp 


Val 


Glu 


Thr 


Ser 


Gin 


Ala 


End 


Asn 


Lys 


Asn 


Ser 


Gly 


Trp 


He 


Val 


Tyr 


Leu 


Arg 


Leu 


Ser 


Lys 


Leu 


Gly 


Gin 


Leu 


End 


Gly 


Gin 


He 


He 


Gly 


Arg 


Tyr 


End 


Leu 


Thr 


Leu 


Glu 


Leu 


Leu 


Thr 


Ser 


Arg 


Val 


He 


Lys 


Phe 


Leu 


Cys 


Phe 


Ser 


Phe 


Leu 


Ser 


Leu 


Asn 


Pro 


Gly 


Asn 


He 


Phe 


He 


Ser 


End 


Leu 


Ser 


Thr 


Trp 


His 


Arg 


Glu 


Ala 


Ala 


Asn 


His 


Gin 


Asn 


His 


Arg 


Ser 


Val 


Thr 


Ser 


Ser 


Leu 


Gly 


Val 


Ser 


Pro 


Ser 


Thr 


Leu 


Pro 


Gly 


Phe 


Gin 


Lys 


Cys 


Val 


Val 


Lys 


Asn 


Lys 


Val 


Thr 


End 


Leu 


Arg 


Ala 


Lys 


Tyr 


Gin 


Leu 


Gly 


Leu 


Leu 


Leu 
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Thr 


Ser 


Phe Gin 


His 


Leu 


Leu Asp Gin 


End 


Asn Ser 


Leu Ser 




Pro 


Ser 


Glv Phe 


Val 


Pro 


Leu 


Val 


Arc? 


Leu 


Leu Pro 


Phe Tyr 


5 


Lys 


Val 


Val Trp 


Pro 


Tvr 


Leu Ara Gin 


Tvr 


Cvs He 


He Leu 




Asp 


Leu 


Ser Ser 


Cys 


Ala Glv Ara Cvs 


Glv Glu Glv Tvr Ser 




Ara Asp Ala Thr 


Cvs 


Asn 


Cvs Asp Tvr Asn Cvs Gin 


His Tvr 






















n5961 
* 


10 


Met 


Glu 


Cvs Cvs 


Pro 


Asp 


Phe 


Lvs 


Arcr 


Val 


Cvs Thr Ala Glv 




Lys 


Ser 


End Glu 


Arg 


Val 


Ser 


Pro 


Leu 


Ser 


Ser Asn 


Thr Ala 




Ser 


Leu 


End Val 


Pro 


Pro 


Cvs 


Thr 


Leu 


Val 


Gin Cys 


Cys Glu 




Thr 


Glu 


Pro Pro 


Leu 


Ala 


Pro 


Thr 


Cvs 


Ser 


Ala Val 


Phe Pro 




Leu 


Tyr 


His Phe 


Leu 


Cvs 


Leu 


Ser 


Asp 


Phe 


Pro Ser 


Leu Ala 


15 


End 


Met 


Leu Gin 


Arg 


He 


Val 


Pro 


Lys 


Asp 


Gly Val 


Lvs Ala 




Cys 


Asp 


Tyr Trp 


Gly 


Glv 


End 


Gly 


Glu 


Pro 


Gly Ser 


Leu Ser 




Leu 


Ala 


Gin Gly 


Tyr 


Cys 


Leu 


Val 


Pro 


Ser 


Lys Cys 


Leu He 




Leu 


Asn 


Arg Glu 


Asp 


Thr 


Val 


Ser 


Gin 


Glu 


Ser Leu 


Ser Lys 




End 


End 


Gly Val 


He 


He 


Tyr 


Asn 


Gin 


Arg 


Cys Pro 


Arg Leu 


20 


Val 


Met 


Arg Thr 


Phe 


Asn 


Ser 


Phe 


He 


Lys 


Arg He 


His Glu 




His 


Gly 


lie lie 


Leu 


Ser 


Thr 


He 


Gly 


Leu 


Ser Asp 


Val Asn 




Thr 


Phe 


Val. /Asn 


Tyr 


Leu 


Glu 


Lys 


Thr 


Leu 


End Arg 


End Ser 




Ala 


S£r 


ValSind 


End 


End 


He 


Ser 


Val 


Lys 


He End 


Arg Ser 




Glu 


Phe 


End lie 


Met 


Ser 


Gly 


Val 


Ser 


Leu 


Glu Tyr 


He Leu 


25 


Gly 


His 


Phe lie 


He 


Asn 


Thr 


Glu 


Thr 


Phe 


Pro He 


Gin Phe 




Lys 


Lys 


lie Val 


He 


Lys 


Val 


He 


End 


Lys 


Val Tyr 


Leu Leu 




End 


Asn 


Leu Asn 


Val 


Lys 


Leu 


Arg 


Asn 


He 


Leu Cys 


Asn Leu 




Cys 


He 


Gly End 


Val 


He 


He 


Ser 


Asn 


Lys 


Pro Trp 


End Val 




Pro 


Val 


Asn Ser 


Asn 


Lys 


Gly 


Phe 


Leu 


Asn 


Phe His 


Phe Leu 


30 


Asn 


GlU 


End Ser 


Ser 


Leu 


Thr 


End 


Ser 


Thr 


Leu End 


Asp End 




Ala 


Glu 


Ser Phe 


Gin 


Leu 


Tyr 


Thr 


Phe 


Pro 


Ser Leu 


Ser End 




End 


Gly 


Pro^Leu 


His 


Tyr 


Lys 


End 


Phe 


Leu 


Ser Leu 


Lys Asn 




Asp 


Pro 


Ph^-^Thr 


Trp 


Asn 


He 


Tyr 


Thr 


Tyr 


He Phe 


He Trp 




Asn 


He 


Leu' Leu 


Arg 


Ser 


Val 


His 


Ser 


He 


Thr Val 


Asn Leu 


35 


Leu 


Gin 


Ser Ser 


Val 


End 


Arg 


Gly 


Glu 


Gly 


End Arg 


Asp Phe 




He 


End 


Arg Lys 


Ser 


His 


Tyr 


Leu 


Asp 


He 


Ser Arg 


Phe Phe 




Phe 


Val 


His Thr 


Cys 


Lys 


End 


Leu 


Gly 


Gin 


His Gin 


Asp Val 




Ser 


Trp 


Ser Leu 


Glu 


Gly 


Gin 


Glu 


Glu 


Leu 


Pro Ser 


Val Thr 




He 


Phe 


Leu Phe 


End 


Leu 


Gly 


Leu 


Ser 


Pro 


Ser Gly 


He Pro 


40 


Ser 


Asn 


Phe Ser 


Trp 


Met 


Met 


Tyr 


Ser 


Lys 


Phe Gin 


He Lys 




Asp 


Leu 


Glu- Met 


Asn 


Phe 


Trp 


Lys 


Pro 


Ser 


Gin Val 


End Gly 




-&Y 


Lys 


Trp Leu 


Ser 


Asn 


Thr 


Trp 


Ala 


Trp 


Leu Bim- 


Asn Glu 




End 


Ser 


Val Thr. 


Ser 


Cys 


Phe 


Ala 


Leu 


Gly 


Arg Ala 


Phe Leu 




End 


Arg 


Pro Leu 


Leu 


End 


Val 


Leu 


Arg 


Glu 


Arg Glu 


Gly Val 


45 


End 


Leu 


Arg Arg 


Pro 


Met 


End 


Glu 


Val 


End 


Gin Val 


Leu Ser 




Arg 


Leu 


End Glu 


Phe 


Leu 


Cys 


Arg 


Arg 


End 


Ala Ser 


Gin Tyr 




Gin 


Pro 


Met Leu 


Leu 


Ser 


Thr 


Ala 


Arg 


? 
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TABLE IB 

Reading frame B - (nucleotides #2 - 7504) 

n2 
* 

lie Phe Leu Leu End Arg Thr Phe Ser Thr Pro Leu Asp His 
Gly Gin Lys Cys Thr His Tyr Trp Ser Thr End Thr Asp Lys 
Phe Val lie Ser Glu Leu End Phe Gin lie Ser Arg Arg Arg 
Lys His lie Gly Ser Gly Cys Ser Ser Ser Asn Ser End Ser 
Tyr Gin Leu Trp Leu Met Trp Glu Asp Asn lie Tyr Asn Gin 
Ser End Lys Glu Asn Ser Pro lie Glu Arg Glu Val Val Pro 
Arg Lys Gly Gly Leu Phe Arg Asp Glu Gin He Ala Ser Lys 
Leu Ser Leu His He End Ser Cys Glu He Ser Thr Val Glu 
Arg Lys End Met Leu He lie Leu Val Leu Lys Lys Ser Gin 
Ser His Lys Pro Met Tyr Thr Cys Arg Val Lys Glu End Asn 
Leu Val End End Leu Gin lie Lys End Trp Phe End Ser Lys 
Gly Ser Trp Met End Trp Lys His Asp Gly Thr End Asn Ala 
Lys Thr Trp Val Gin He Leu Val Leu Ala Ser Ser Leu Tyr 
Arg Ser Leu Asp Gly Ser His He Leu His Leu Trp Val Phe 
He Phe Leu Phe Val Asn Val Asp Asp Tyr He End Leu End 
Asp He Leu End Glu Leu Asn Ala Val Pro Thr Lys End Val 
Val Asp Ser Arg He Gin Pro Thr Thr Met Trp Phe Thr Glu 
His Val Phe Phe Phe Phe Phe Ser Phe Phe Phe Phe Glu Thr 
Glu Phe Cys Ser Cys Cys Pro Ser Trp Cys Ala Val Ala End 
Ser Gin Leu Thr Ala Thr Ser Ala Ser Gin Val Gin Val He 
Leu Leu Pro End Pro Pro Glu End Leu Glu Leu Gin Ala Pro 
Ala Thr Thr Pro Gly End Phe Phe Val Phe Leu Val Glu Arg 
Glu Phe His His Val Gly Gin Ala Gly Leu Lys Leu Leu Thr 
Ser Gly Asp Leu Pro Thr Ser Ala Ser Gin Ser Ala Gly He 
Ala Gly Val Ser His Cys Thr Arg Arg Glu His Val Leu Asn 
He End His Trp End Asn Asn Trp Ala Val Cys Trp Gin End 
Thr Asn lie End Val Leu Val Val Leu Arg Glu He He Val 
Cys Trp Gly Cys Glu Thr Asp Glu Glu He Phe End End Asn 
Val Val Gin Leu Phe Thr He He Leu Asp Val He Gly He 
End Ser Gly Lys Val Ser Gly Arg Cys End Asp Met Ser Glu 
Ala Leu Cys Trp Trp Val Arg Phe Tyr Thr Asp Thr Phe Arg 
Met Lys Val Gly End Ser Leu Gly Leu Ser End Leu Ser Ser 
Ala Met Val Met Val His He Phe He Val Gly Arg Lys Gin 
Asn End End Phe Lys Arg Pro Phe Pro Asn Met Leu He Leu 
Lys Glu Gly Ser Lys Glu Asn He Asn Thr Phe Phe Lys Gin 
He Phe Lys He He Lys Glu Ala Arg Lys Ser Ser Val Asn 
Asn Leu Pro Val Gly Asn End Glu His Val Phe Cys Ala Gin 
Arg Lys End Asn Tyr_Ser Arg Asn Val Tyr Leu Ile_ End _Irp 
Tyr Lys Tyr He £ys Cys Met Pro G±r. Ala Asn Thr Arg"Ehd 
He Ser Lys Leu Pro Leu Leu Asp Leu Thr Gly Lys Gin Lys 
Lys End Met He Pro Val Cys Glu Tyr Phe Lys Val Ala Ala 
Ser He Tyr End Ser Lys Ser Ala Arg Ala He Tyr His Leu 
Arg Thr lie Cys-Ser Leu His Leu Glu He Arg Asn Val Leu 
Trp Pro lie His Leu Asn Leu Phe Cys Gly Ser His Phe Ala 
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End Trp Lys Cys Phe Pro Asp Val Ala End Val End Ser Ser 
Gly Pro Phe Ser Asn Lys End Gly Lys Phe Asp Phe lie Gly 
Cys His Cys End Phe Cys Leu Thr Phe Gly Leu lie Gly Ser 
Ser His Cys lie Cys Thr Asp lie End Asp Ser Gin Asp Val 
5 Gly End Thr His Leu Ser Phe Thr Gly Lys Cys Gin Ser End 

Leu Ala Ser End Asn End Met Cys Phe His End Arg Cys Leu 
Lys Lys Asn Val End Arg End End Leu Glu He Phe Glu Lys 
Asp Tyr Val Tyr Ala He His Leu Leu Glu Ala Phe Glu Phe 
He Leu Arg lie Lys Lys Lys Leu Glu Gin End Leu His Asp 

10 Ala Tyr End lie Thr Phe Leu End His Arg Leu Leu Phe Ser 

Leu Arg End Asn Phe Asn Leu Ala Leu Leu Ser Thr Cys His 
Thr Leu Ala Arg Val Leu Arg End Val Tyr Ser He Leu Glu 
He Ser End Glu Leu Lys Leu Ser He Thr Trp Leu Arg Val 
Thr End Hie Val Leu His Lys He Lys Ser Asn Gly Gin He 

15 Phe Ser He Lys Leu Lys Leu He End Ser Leu Val Ser Arg 

Met Gly Trp Phe Tyr Tyr Cys Tyr He Phe Lys Val Thr Glu 
Met Cys He Met Gin Ala End Val Ser Gly Glu Met Lys Glu 
Leu Phe Ser Asp Thr Phe He Leu Phe Ser Ala Arg Val Pro 
Thr Val Pro Glu Asn Asn Asp Gly Met Glu Asn Thr Ser His 

20 Leu Pro Val Val Ala Ala Val Cys Phe Arg Asp Ser Ala Ser 

Phe He Ser Arg End- Leu Asn His Arg Thr Tyr Phe Tyr Leu 
Thr Thr He Ala Asn His Ser Val Leu He Phe He Thr Thr 
Glu lie Tyr Phe End Lys. Phe He Phe Ala End Val End Tyr 
Thr Val Gin Pro End Asn Asn He End His Val Ala Cys Leu 

25 End Val Leu Trp Asn Ser His Leu Phe Leu Leu Leu Arg Val 

Leu Ser Leu Tyr Lys Glu End Phe .Trp Tyr; Leu Leu Phe Leu 
Trp Phe Ser Arg Lys Phe Lys Lys Cys Arg Gin Phe He He 
Lys Leu . Cys Trp Ser Tyr lie Val Ser End Lys End Leu Lys 
Lys Asn Asn "He End He Thr Cys Asn Phe Cys Leu Tyr Val 

3 0 Glu Gin Phe Val Phe End Asn Gin He Asn Tyr End End lie 

Thr Tyr Phe Gly He He End Thr End He Pro Tyr Tyr Asn 
Lys Leu End Cys Phe Lys Trp His Phe He Ser Leu Glu Asn 
Val Leu Phe Phe Leu Gin Glu Leu Phe lie, Gin Val Trp Thr 
Tyr Thr Asp Val Thr Val lie Leu Cys Tyr Asn He Lys Leu 

35 Phe Thr Val Leu Pro Lys Gin Val Leu Asp Glu End Phe Leu 

Leu Leu He Phe Arg Asp Cys End Ser Thr Phe Ser Glu He 
Thr Leu Val Lys Asp Ala Ser Met Pro Ser Arg Gly He Asn 
Ser Asn Asn Ala Lys Leu Trp He Phe Leu Phe Phe Ser Phe 
Pro Phe Val Pro Leu End End Ala Lys Ser Glu Arg He Lys 

40 He Lys lie His Asp Ser Tyr Asp Glu He Met Phe Leu Lys 

End Thr Phe Pro Val Lys Tyr Glu He Leu Met Lys Pro Glu 
His Glu Trp JEnd Leu Ser Ala End Ala End Val Ala Glu Leu 
Phe Arg End ,Glu Lys Thr Gly Lys His Met Lys Ala Asn Val 
Asn^Ile GlnjSnd Glv^nd End Leu Leu Gin Tyr Leu Gin Thr 

45 Leu Arg Leu * Ser Arg Asn Thr Met Thr Arg Phe Thr Thr Leu 

Gly Ala Leu Ser Lys Leu Pro Phe Pro Leu End Tyr Asn Trp 
Leu Phe Thr Lys Thr Val Leu Leu Ser Phe End Leu End Leu 
Asn Lys Ser Asn End Asn End Trp Gly Trp Val Trp Trp Leu 
Thr Pro Val He Pro Ala Phe Trp Glu Ala End Gly Asp Arg 
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Ser Pro Glu Val Trp Ser Ser Arg Ala Ala Trp Ala Thr Trp 
Gin Asn Leu Val Ser lie Gin Asn Thr Lys Lys Leu Ala Arg 
Cys Gly Ser Val Cys Leu End Ser Gin Leu Phe Gly Gly End 
Gly Lys Arg He Ser End He Gin Glu Val Glu Val Ala Leu 
Ser End Ala Cys He Met Ala Leu Gin Pro Gly Gin Gin Ser 
Glu Thr Leu Ser Gin Lys End lie Asn Lys End He Lys Leu 
Met Val Thr He Asn Ser Asn Lys Ser Arg Thr Asp Thr End 

n4202 



Ser Met Lys Val Ser Phe Ser Cvs Lys Asp Asp Phe T.em PhP 

Phe Phe Phe Leu Leu Arg Gin Gly Leu Ala Leu Ssr Pro Ser 

Leu Glu Cys Ser Glv Thr He T ,eu Ala His Cvs Asn T«« wis 

Leu Arg Glv Ser Ser Asp Ser Pro Val Ser A1, a Ser Arg Val 

Ala Gly He Thr Glv Pro Cys His His Ala Arg T.eu He Phe 

Val Phe Leu Val Lvs Thr Glv Phe Asn His Val Glv Bin Ala 

n4465 n4466 
* * 

Glv Leu Thr End Pro Gin Val He Arg Pro Pro Gin Prn Prn 
Lys Val Leu Glu Leu Gin Me t He Ser Tvr Phe Arg Asn Leu 
Pro Thr He Lvs Glu Gin Ser Leu Asp Thr Val Leu Ssr Ala 
Ser Leu Ala Phe Ser Thr S er Ser His Ser Leu Phe s B r t^h 
Tvr He He Tvr Val Phe He Tvr He His Tvr Tie Tvr Tie 
Phe Val Cys He Met His Se r Cvs Thr Gin Lvs Val Leu T.ys 
Val Val Leu Gin P he Thr Val Leu Phe Ala Tvr Ser Asp Phe 
Gly He Pro Glv Leu Tvr Ser Phe Lvs Tie Cvs Phe Gin Cvs 
Val Ser Thr Phe Leu Cvs Gly P he Arg Thr Val His T,gn Prn 
He Ser He Pro Glv Met Asp Ser He Val Thr Phe Leu Gin 

n4915 
* 

Ser Phe Leu Arq Glu Glv Ph e Lvs Val Val Phe Ser Lvs End 
Leu Leu Lys Asn He End Glu End Val Cys Phe Cys Tyr Cys 
Leu Leu He Cys Cys Leu Glu Asn Leu Cys Ser End Pro Leu 
End Leu Cys Pro Val He Leu Arg Val Leu Thr Ser Ala Val 
Val Val Ser lie Cys Leu Pro End His Phe Lys Asn Pro Val 
He Pro Thr Asn End Gly Arg Lys Ser Ser Val Lys Phe Met 
Leu Val Gin Pro Thr Cys Phe Lys Leu He Leu Arg Asn Gly 
Arg Gin Ser Asp Arg Trp Lys His Leu Arg Leu Lys He Arg 
Tyr He Arg Val Gin He Leu Asp Gly Leu Phe Thr End Gly 
Asp His Lys Met Tyr Tyr Leu Asn End Asp Asn' Tyr Lys Gly 
Lys Arg Val His Tyr Lys End Leu Gly Gly He Asn End Pro 
Met Met Pro He Tyr Trp Asn Tyr End Leu Val Val End End 
EiTQ-Ala-Sfer Cys. Gin Asn Pi5@ Cys Val Ser Ala Phe Leu-Ala 
Leu Lys Lys Gly Tyr End He Leu Glu Thr Phe Ser End Ala 
Leu Val He Met Tyr Asp Ser Ala Pro Gly He Glu Lys Leu 
Asn Lys Trp Gin Leu Leu He He Lys Thr Thr Asp Arg End 
End Leu End Leu Pro Pro Leu Pro End Glu Tyr His Pro Leu 
End Ala Lys Pro Leu Leu Cys Leu Val Phe Lys Asn Val Phe 
He He Ser Val End End Arg He Lys End His Asn End Asp 
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His Leu Arg His Gin Pro Asn He Ser Trp^Ala Tyr Cys End 
His His Ser Asn Thr Phe Ser lie Asn Lys* (nucleotide 
#5743)... 

[Reading frame for nucleotides #5742 - #5961 
5 (Exon I) has been determined as reading frame A] 

(nucleotide 5963) Val* Ser Pro Glu Ser Gly Cys Leu Leu 







tZl n 

V7J.I1 


Ala Thr 


.Lieu 




Val 


cys 


ulu 


oer 


fro 


Leu 


iiia 


Pro 




oer 


Lrys 


Asn 


Ala 


Veil 


Arg 


Leu 


DCL 


Leu 


Pro 


Leu 


rilS 


Pro 


Leu 




Ala 


Val 


Leu 


Phe 


OC1 


illb 


^*ys 


1X6 


1X11 






Ala 


Juno. 


vax 


1U 


inr 


rue 


Leu 


His 


Leu 


tin e* 


Lys 


Cys 


Cys 


Ser 


Ala 


Leu 


Tyr 


T ah 

Leu 






net 


Val 


Ser 


Lys 


Leu 


Val 


xie 


lie 


laiy 


UlU 


trxy 


Lys 


Hi «r 

GJ_y 




Ser 


Leu 


Glu 


Val 


Cys 


HIS 


Leu 


TT 4 — 

rilS 


Lys 


val 


Tl . 

lie 


Val 


End 


Pne 






val 


Asn 


:Val 


lieu 


irne 




lie 


ulU 


Lys 


lie 


Leu 


±J1CI 


va± 




Arg. 


o a >*» 
Del 


His 


End 


val 


Asn 


Ser 


Lys 


vjIU 


Jbna 


Leu 


Fne 


inr 


lie 


13 




A en 
A5p 


Val 


Leu 


Asp 


&na 


Hiii a. 


itillCl 


pi If 


Leu 


Ser 


ixe 


vax 


Ser 






T Are 

jjys 


Glu 


Tyr 


net. 


Dei 






riii a 


O 'v* 

oer 


Cys 


Leu 


Pro 


Leu 






it i it; 


Leu 


Met 


JjcU. 


rp V» v* 
1 111 


riiy 


liSU 




lie 


inr 


Trp 


Lys 


Arg 




His 


Phe 


Lys. 


Asp 




V CL J- 




veil 


STL 1*2 






kJl u 


file? 


Cl n 

vriri 




Leu 


Lys 


Ser 


Glu 


Gly. 


Leu 


Asn 


Phe 


Lys 


Leu 


End 


Val 


Val 


Phe 


20 


His 


Leu 


Asn 


He 


Fhe 


Trp 


Ala 


lie 


Leu 


Leu 


Leu 


Thr 


Gin 


Lys 




His 


Phe 


Leu 


Phe 


Asn 


Leu 


Arg 


Lys 


Leu 


End 


Leu 


Lys 


Leu 


Phe 




Glu 


Arg 


Cys 


He 


Cys 


Phe 


Arg 


He 


End 


Met 


End 


Ser 


End 


Gly 




lie 


Tyr 


Cys 


Ala 


He 


Tyr 


Val 


Leu 


Asp 


Arg 


End 


Leu 


Phe 


Gin 




He 


Ser 


Leu 


Gly 


Lys 


Ser 


Leu 


Leu 


Thr 


Leu 


Thr 


Lys 


Ala 


Phe 


25 


End 


He 


Phe 


lie 


Phe 


End 


Met 


Ser 


Ser 


Pro 


Pro 


End 


His 


Ser 




Pro 


His 


Cys 


Lys 


lie 


Arg 


Leu 


Lys 


Ala 


Phe 


Asn 


Tyr 


Thr 


Pro 




Ser 


Pro 


Val 


Cys 


His 


Asn 


Lys 


Gly 


Leu 


Cys 


He 


Thr 


Asn 


Asp 




Phe 


End 


Ala 


Ser 


Lys 


Met 


Thr 


His 


Leu 


Arg 


Gly 


lie 


Tyr 


He 




His 


He 


Tyr Leu 


Tyr 


Gly 


He 


Tyr 


Cys 


End 


Asp 


Leu 


Tyr 


He 


30 


Gin 


Leu 


Leu End 


He 


Tyr 


Tyr 


Lys 


Ala 


Val 


Cys 


Glu 


Glu 


Glu 




Lys 


Asp 


Glu 


Glu 


He 


Ser 


Tyr 


Glu 


Gly 


Tyr 


Leu 


Thr 


He 


End 




Thr 


Phe 


Pro -Asp 


Phe 


Ser 


Leu 


Ser 


He 


His 


Val 


Asn 


Asn 


Ser 




Gly 


Ser 


lie Arg 


Met 


Ser 


Leu 


Gly 


Val 


Trp 


Lys 


Gly 


Lys 


Arg 




Ser 


Cys 


Pro 


Gin 


Ser 


Pro 


Tyr 


Phe 


Phe 


Phe 


Asp 


Leu 


Gly 


Cys 


35 


Leu 


His 


Leu 


Gly 


Tyr 


His 


Leu 


He 


Phe 


Pro 


Gly 


End 


Cys 


Thr 




Pro 


Asn 


Phe 


Lys 


End 


Lys 


Thr 


End 


Lys 


End 


Thr 


Phe 


Gly 


Asn 




Leu 


Val 


Lys 


Ser 


Lys 


Val 


Gly 


Asn 


Gly 


Cys 


Gin 


He 


Arg 


Gly 




Pro 


Gly 


Phe 


Thr 


Met 


Asn 


Asn 


Leu 


End 


Leu 


Leu 


Val 


Leu 


Leu 



rt7340 
40 * ' 

Trp Val Glu Leu Ser Cvs Lvs Gly ArcL Cys Phe Glu Ser Phe 
Glu Ara GlvAro.Glu Cvs Asp Cys Asn Ala Gin Cvs Lys Lys 

n7459 
* 

45 Tyr Asp Lvs Cys Cys Pro Asp Tyr Glu Ser Phe Cvs Ala Glu 

Gly Lys His His Ser Thr Asn Gin Cys Phe Ser Val Gin Pro 
Asp ? 
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TABLE IC 

Reading frame C - (nucleotides #3 - 7505) 



40 



45 



n3 
* 



5 Ser Phe Tyr Ser Glu Gly Leu Ser Leu Leu Leu End Thr Met 
Gly Arg Asn Val His He He Gly Leu Arg Arg Gin Thr Asn 
Leu End Ser Leu Asn Tyr Asn Phe Lys Phe Pro Gly Glu Glu 
Asn He Leu Ala Arg Val Val Gin Val Pro He Pro Asn Pro 
He Ser Cys Gly End Cys Gly Lys He Thr Tyr lie Thr Arg 
10 Ala Lys Arg Lys He Ala Arg End Lys Glu Lys End Phe Pro 
Glu Lys Gly Asp Tyr Leu Glu Met Ser Arg End Pro Arg Asn 
Cys Leu Tyr He Phe Ser Pro Val Lys End Ala Leu End Lys 
Glu Asn Arg Cys Leu End Phe End Ser End Lys Ser His Asn 
Leu Thr Asn Leu Cys He His Val Glu End Arg Ser Lys He 
15 End Tyr Asn Asp Tyr Lys Leu Ser Asp Gly Phe Asp Gin Lys 
Gly Ala Gly Cys Ser Gly Asn Met Thr Glu Leu Lys Met Gin 
Arg His Gly Phe Lys Ser Trp Phe Trp Pro Leu Val Phe He 
Asp Pro Trp Met Gly His Thr Phe Phe He Cys Gly Ser Ser 
Ser Ser Tyr Leu End Met Trp Met He He Phe Asn Phe Lys 
2 0 Thr Phe Cys Glu Asn. End Met Gin Tyr Leu Pro Asn Glu End 
End Thr Ala Glu Ser Ser Leu Gin Pro Cys Gly Ser Leu Asn 
Met Phe Ser Phe Phe Ser. Phe Leu Phe Phe Phe Leu Arg Leu 
Ser Phe Ala Leu Val Ala Gin Ala Gly Val Arg Trp Arg Asp 
Leu Ser Ser Leu Gin Pro Leu Pro Leu Arg Phe Lys End Phe 
25 Ser Cys Leu Ser Leu Pro Ser Asn Trp Asn Tyr Arg Arg Leu 
Pro Pro His Pro Ala Asn Phe Leu Tyr Phe End End Arg Gly 
Ser Phe Thr Met Leu Val Arg Leu Val Ser Asn Ser End Pro 
Gin Val He Cys Pro Pro Gin Pro Pro Lys Val Leu Gly Leu 
Gin Gly End Ala Thr Ala Pro Gly Val Ser Met Phe End Thr 
30 Tyr Asn He Gly Glu Thr Thr Gly Leu Tyr Val Gly Asn Arg 
Leu Thr Tyr Arg Cys Trp Leu Ser Leu Gly Lys Ser End Ser 
Val Gly Gly Val Arg Gin Met Lys Arg Tyr Phe Ser Lys Thr 
End Tyr Asn Cys Leu Leu End Phe End Met End Leu Glu Phe 
Asp Leu Glu Arg Phe Gin Gly Gly Ala Arg Thr End Ala Lys 
35 Pro Tyr Val Gly Gly End Asp Phe Thr Pro He His Ser Val 
End Lys Trp Gly Lys Ala Leu Gly End Ala Asp Leu Ala Leu 
Gin Trp End Trp Phe Thr Phe Ser Leu Trp Glu Gly Asn Lys 
He Ser Asn Leu Lys Asp Leu Phe Gin Thr Cys Leu Ser End 
Arg Lys Asp Leu Arg Lys He Ser He His Phe Leu Ser Arg 
Phe Leu Lys End End Arg Lys Leu Glu Asn Pro Val Ser He 
He Tyr Leu End Val Thr Glu Ser Met Phe Ser Val Leu Arg 
Glu Asn Glu Thr Thr Gin Glu Met Tyr He End Tyr Asn Gly 
He Ser He End Asn Val Cys His Lys Leu Thr Leu_Gly Lys 
Phe Gin Ser Tyr -Leu-Cys End He End- Leu Gl-n Ser Arg"Krg 
Asn Lys End Tyr Gin Tyr Val Asn He Leu Lys Leu Leu Leu 
Gin Tyr Thr Ser Gin Ser Leu Pro Glu Leu Phe Thr He Ser 
Gly Gin Phe Val His Phe Thr End Lys Leu Glu Met Cys Ser 
Gly Gin Tyr He End He Tyr Phe Val Val Val He Leu Leu 
Asn Gly Ser Val Phe Gin Met Leu Pro Lys Ser Ser Arg Leu 
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Gly 


Pro 


Phe Pro 


He 


Asn 


Glu 


Glu 


Ser 


Leu 


He 


Ser 


End 


v ax 




v ai 


xxxx 


Val Asp 


Phe 


Val 


End 


Pro 


Leu Asp 


End 


Leu 


Val 


ms 




Pro 


Thr 


Val Phe 


Ala 


Leu 


He 


Tyr 


Lys Thr Pro Arg Thr 


Trp 




Asp 


Lys 


Leu lie 


Tyr 


Pro 


Leu Arg 


Val 


Ser 


Val 


Lys Val 


Asn 


5 


Leu 


Leu 


His Arg lie Lys Cys Val 


Phe He Arg Gly 


Val 


End 




Lys 


Lys 


Met Cys 


Lys Asp 


Ser 


Ser 


Trp 


Lys 


Phe 


Leu Lys 


Arg 




T1 o 
xxc 


JTifcJ L. 


Phe Met 


Gin Tyr Thr 


Cys 


Trp 


Lys 


Pro 


Leu 


Asn 


Leu 




Tyr 


End. 


Glu Leu Arg Lys 


Ser 


Trp 


Asn 


Ser 


Asn 


Ser 


Met 


Met 




Leu 


He 


Lys Leu 


His 


Phe 


Cys 


Asp 


Thr Gly Tyr Tyr Phe 


Pro 


10 


End 


Asp 


Lys lie 


Leu 


Thr 


Leu 


His 


Cys 


End 


Val 


His 


Ala 


He 




Leu 


Trp 


Leu Glu 


Phe End Asp 


Lys 


Ser 


He 


Leu 


Tyr Trp 


Arg 




Phe 


Leu 


Lys Asn End Asn Cys Gin 


End Arg Gly End Gly 


End 




Leu 


Asp 


Leu Phe 


Cys 


He 


Arg 


Leu 


Ser 


Leu 


Met 


Ala 


Lys 


Tyr 




Phe 


Leu 


End Asn 


End 


Ser 


End 


Phe 


Lys 


Val 


Leu 


Phe 


Gin 


Glu 


15 


Trp 


Asp 


Gly Phe 


He 


He 


Val 


Thr 


Phe 


Leu 


Lys 


Leu 


Leu 


Lys 
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Cys 


Val 


End Cys 


Lys 


Pro 


Lys 


Leu 


Val 


Val 


Arcr End Lys 


Ser 




Cvs 


Phe 


Leu He 


Leu 


Leu 


Phe 


Tyr 


Phe 


Gin 


Gin Glv Tvr 


Leu 




Arcr Tvr 


Leu Lys 


Thr- Thr Met 


Ala 


Tro 


Lys 


Thr 


Leu 


Pro 


He 




Tyr 


Leu 


Leu Leu 


Leu Leu 


Ser 


Val 


Phe 


Val 


He 


Gin 


Gin 


Val 


























n27 32 




Ser 


Ser 


Gin Glv 


Ser 


Leu 


Thr 


He 


Glu 


His 


Thr 


Phe 


He 


End 




Gin 


Leu 


Leu Leu 


lie 


He 


Gin 


Ser 


End 


Phe 


Leu 


End 


Gin 


Arg 




Lys 


Tyr 


lie Ser 


Lys 


Asn 


Leu 


Tyr 


Leu 


Leu 


Glu 


Phe 


Asn 


He 




Leu 


Tyr 


Asn Leu 


Lys 


He 


He 


Tyr 


Asn 


Met 


End 


Pro 


Val 


Cys 




Lys 


Cys 


Phe Gly 


lie 


Leu 


He 


Phe 


Phe 


Ser 


Phe 


End 


Glu 


Phe 




End 


Ala 


Cys lie 


Lys 


Asn 


Asn 


Phe 


Gly 


Thr 


Cys 


Tyr 


Phe 


Tyr 


30 


Gly 


Ser 


Leu Glu 


Ser 


Leu 


Lys 


Ser 


Ala 


Asp 


Ser 


Ser 


Ser 


End 




Asn 


Tyr 


Ala Gly 


Ala 


He 


End 


Tyr 


His 


Lys 


Asn 


Asn 


End 


Lys 




Arg 


Thr 


lie Phe 


Arg 


End 


His 


Val 


lie 


Phe 


Ala 


Tyr 


Met 


Leu 




Asn 


Asn 


Leu Phe 


Phe 


Lys 


lie 


Lys 


End 


He 


He 


Asp 


Lys 


End 




Leu 


He 


Leu Ala. 


Ser 


Phe 


Arg 


His 


Arg 


Tyr 


His 


He 


He 


He 


35 


Asn 


Tyr 


Ser Val 


Ser 


Asn 


Gly 


He 


Leu 


Leu 


Val 


Leu 


Lys 


Met 




Phe 


Ser 


Phe Ser 


Tyr 


Lys 


Asn 


Tyr 


L&u 


Ser 


Arg 


Tyr 


Gly 


His 




lie 


Gin 


Met End 


Leu 


Ser 


Phe 


Cys 


Val 


He 


Thr 


Leu 


Asn 


Phe 




Ser 


Gin 


Phe Ser 


Gin 


Asn 


Arg 


Ser 


Trp 


Met 


Asn 


Ser 


Ser 


Cys 




Tyr 


Ser 


Phe Leu 


Glu 


Thr 


Val 


Lys 


Val 


His 


Ser 


Val 


Lys 


Ser 


40 


His 


End 


End j/ys 


Met 


Leu 


Ala 


Cys 


His 


Leu 


Gly 


Gly 


Leu 


He 




Val 


Thr 


Met Pro 


Ser 


Phe 


Gly 


Phe 


Phe 


Phe 


Ser 


Phe 


Leu 


Phe 




Leu 


Lieu 


Phe Pro 


Phe _Asp 


Lys _£ln— nsn 


Lerg. Arg 


Glu 


End 


Lys-^ 




Ser 


Arg 


Phe Met 


.Thr 


Val 


Met 


Met 


Lys 


Leu 


Cys 


Phe 


End 


Ser 




Lys 


His 


Phe Gin 


End 


Asn 


Thr 


Arg 


Phe 


Leu 


End 


Asn 


Leu 


Asn 


45 


Met 


Ser 


Gly Asn 


Cys 


Leu 


His 


Arg 


His 


Lys 


Leu 


Gin 


Asn 


Cys 




Leu 


Asp 


Lys Lys 


Arg Gin 


Glu 


Asn 


Thr 


End 


Lys 


Gin 


Met 


End 




He 


Phe 


Asn Lys 


Asn Asp 


Asp 


Tyr 


Ser 


Ser 


He 


Ser 


Arg 


Pro 




Phe 


Gly 


Phe Leu 


Val. Thr 


Leu 


End 


Gin 


Gly 


Ser 


Gin 


His 


Trp 
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Gly His Phe Leu Asn Cys Leu Phe Leu Cys Asp Thr lie Gly 
Cys Ser Leu Lys Gin Cys Tyr Phe His Phe Asn Cys Asp End 
He Asn Gin lie Lys lie Asn Gly Ala Gly Cys Gly Gly Ser 
His Leu End Ser Gin His Ser Gly Arg Pro Arg Glu Thr Asp 
His Leu Arg Ser Gly Val Gin Glu Gin Pro Gly Gin His Gly 
Lys He Leu Ser Leu Tyr Lys He Gin Lys Asn End Pro Gly 
Val Val Val Cys Ala Cys Asn Pro Ser Tyr Ser Glu Ala Glu 
Ala Arg Gly Tyr Leu Glu Ser Arg Arg Trp Arg Leu His End 
Ala Glu Leu Ala Ser Trp His Ser Ser Leu Gly Asn Arg Val 
Arg Leu Cys Leu Lys Asn Lys End He Asn Lys End Asn End 
Trp Leu Gin Leu He Ala He Lys Val Gly Gin Thr Pro Asn 
Leu End Lys End Ala Phe Pro Val Arg Met He Ser Ser Phe 
Phe Phe Phe Phe End Asp Arg Val Leu Leu Cys Leu Pro Val 
Trp Ser Ala Val Ala Gin Ser Trp Leu Thr Ala Thr Ser Thr 
Tyr Gly Val Gin Ala He Leu Leu Ser Gin Pro Pro Glu End 
Leu Gly Leu Gin Ala Pro Ala Thr Met Pro Gly End Phe Leu 
Tyr Phe End End Arg Gin Val Leu Thr Met Leu Ala Arg Leu 
Val Ser Pro Asp Leu Arg End Ser Ala Arg Leu Ser Leu Pro 
Lys Cys Trp Asn Tyr Arg End Phe. Leu He Ser Glu He Cys 
Gin Leu End Lys Ser Asn Leu Leu He Leu Ser Cys Leu Leu 
Leu Leu Leu Ser Gin- Pro Leu Leu He Leu Phe Phe Leu Tyr 
He End Tyr Met Tyr Leu Tyr Thr Tyr Thr He Tyr Thr Phe 
Leu Tyr Ala Leu Cys Thr His Val Arg Lys Lys Phe End Lys 
Leu Ser Tyr Asn Leu Leu Phe Tyr Leu His He Gin Thr Leu 
Ala Phe Leu Asp Ser He Leu Leu Arg Phe Val Phe Ser Val 
Phe Gin His Ser Phe Val Asp Leu Gly Gin Tyr Thr Cys Gin 
Phe Leu Phe Gin Gly Trp He Pro Leu Ser His Phe Cys Ser 
His Phe Ser Gly Arg Val Leu Arg Trp Cys Phe Pro Asn Asp 
Phe End Lys He Phe Glu Asn Glu Tyr Ala Phe Val He Val 
Cys Leu Phe Val Ala Trp Lys Thr Ser Val His Asp Leu Tyr 
Ser Tyr Ala Gin Leu Phe End Gly Phe Ser Leu Gin Gin Trp 
Leu End Ala Ser Ala Ser Pro Asp He Leu Lys He Gin Leu 
Ser Pro Pro He Lys Glu Glu Lys Ala Gin End Ser Leu Cys 
Trp Cys Asn Gin Arg Ala Leu Asn Leu Ser Ser Gly Met Gly 
Gly Lys Ala Thr Gly Gly Asn He Ser Gly Leu Lys End Asp 
He Ser Glu Phe Lys Phe Trp Met Asp Cys Leu Leu Lys Val 
Thr He Lys Cys He He End He Arg Thr He He Arg Val 
Lys Glu Phe Thr Thr Asn Asn Trp Glu Val Leu He Asn Leu 
End Cys Gin Tyr Thr Gly Thr He Asn End Ser Cys Asp Asn 
Glu Gin Val Val Lys He Ser Val Phe Gin Leu Ser End Leu 
End Lys Arg Ala Thr Lys Ser Trp Lys His Phe His Lys His 
End End End Cys Met Thr Gin His Leu Ala End Arg Ser Ser 
He Asn Gly Asn Cys End Ser Ser Lys Pro Gin He Gly Ser 
Ser Cys Ser Cys His Leu Phe Leu Arg Ser He Thr Leu Phe 
Arg- Gin Ser-HisTyr Ffce Ala Trp Phe Ser 1 Lys Met Cys Leu 
Ser Ser Gin Ser Ser Glu Glu End Ser Asp He He Glu He 
Thr End Asp He Ser Gin He Ser Ala Gly Pro He Ala Asp 
He He Pro Thr Pro Ser Arg Ser He* (nucleotide 5741) . . . 
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Coding sequence for Exon I occurs from 
nucleotides #57.42 - #5961 

(nucleotide 5961) Gly** End Val Leu Arg Ala Gly Val Ser 
Ser Val Lys Gin His Cys Glu Ser Val Ser Pro Pro Leu His 
5 Pro Arg Ala .Met Leu End Asp End Ala Ser Pro Cys Thr His 

Leu Gin Cys Cys Phe Pro Thr Val Ser Leu Pro Leu Leu Lys 
End Leu Ser Phe Thr Cys lie Asn Val Ala Ala His Cys Thr 
End Gly Trp Cys Gin Ser Leu End Leu Leu Gly Arg Val Arg 
Gly Ala Trp Lys Ser Val Thr Cys Thr Arg Leu Leu Ser Ser 

10 Ser Lys End Met Ser Tyr Phe Lys End Arg Arg Tyr Cys Glu 

Ser Gly Val lie Lys End lie Val Arg Ser Asn Tyr Leu Gin 
Ser Glu Met Ser Ser Thr Ser Asn Glu Asp Phe Gin End Phe 
His Gin Lys Asn Thr End Ala Trp Asn Asn Leu Val Tyr His 
Trp Thr Phe End Cys End His Val Cys Lys Leu Pro Gly Lys 

15 Asp Thr Leu Lys lie Lys Cys End Cys Leu Met Met Asn Phe 

Ser End Asn Leu Lys Val End lie Leu Asn Tyr Glu Trp Cys 
Phe Thr End lie Tyr Phe Gly Pro Phe Tyr Tyr End His Arg 
Asn lie Ser Tyr Ser lie End Glu Asn Cys Asn End Ser Tyr 
Leu Lys Gly Val Phe Ala Leu Glu Phe Glu Cys Glu Val Glu 

20 Glu Tyr lie Val Gin Phe Met Tyr Trp lie Gly Asn Tyr Phe 

Lys End Ala Leu Val Ser Pro Cys End Leu End Gin Arg Leu 
Phe Lys Phe Ser Phe Phe. Lys End Val Val Leu Pro Asn lie 
Val His Thr ^al Arg Leu Gly End Lys Leu Ser Thr lie His 
Leu Pro Gin Phe Val lie lie Arg Ala Ser Ala Leu Gin Met 

25 , lie Phe Lys Pro Gin Lys End Pro lie Tyr Val Glu Tyr lie 
Tyr lie Tyr lie Tyr Met Glu Tyr lie Ala Glu He Cys Thr 
Phe Asn Tyr Cys Glu Ser He Thr Lys Gin Cys Val Lys Arg 
Arg Arg Met. -".Lys Arg Phe His Met Lys Ala He Ser Leu Ser 
Arg His Phe Pro He Phe Leu Cys Pro Tyr Met End He Thr 

30 Arg Ala Ala £er Gly Cys Leu Leu Glu Ser Gly Arg Ala Arg 

Gly Val Ala Leu Ser His His He Ser Phe Leu Thr Trp Ala 
Val Ser lie Trp Asp Thr He End Phe Phe Leu Asp Asp Val 
Leu Gin He Ser Asn Lys Arg Leu Arg Asn Glu Leu Leu Glu 
Thr End Ser Ser Leu Arg Trp Glu Met Ala Val Lys Tyr Val 

35 Gly Leu Ala Ser Gin End He He Cys Asn Phe Leu Phe Cys 

Ser Gly End- Ser Phe Pro Val Lys Ala Ala Ala Leu Ser Pro 
Ser Arg Glu Gly Gly Ser Val Thr Ala Thr Pro Asn Val Arg 
Ser Met Thr S«=>r Ala Val Pro He Met Arg Val Ser Val Gin 
Lys Val Ser lie Thr Val Pro Thr Asn Ala Ser Gin Tyr Ser 

40 Gin He ? 
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TABLE II 

Partial Clone Containing Exon III 

TTTTATAGCA ATGCATCCTT AGCTTAAATG GAGTATTCGT 

5 GTTGAGCTGG AGCCTGCCTG CACTGGCTGT CACCAGCATC 

TACTCTTGAA GCTAGATGCA TCTGTGCTTT TCACAGTTAG 

AGCTGCTGAT GAACATAAAC AAGATGTTAA CTGACTTGTC 

TTACTTGGCC TCA GTG CAT AAT CCC ACA TCA CCA CCA TCT 
0 Val His Asn Pro Thr Ser Pro Pro Ser 





TCA 


AAG 


AAA 


GCA 


CCT 


CCA 


CCT 


TCA 


GGA 


GCA 


TCT 


CAA 


ACC 


239 




Ser 


Lys 


Lys 


Ala 


Pro 


Pro 


Pro 


Ser Gly 


Ala 


Ser 


Gin 


Thr 


15 


ATC 


AAA 


TCA 


ACA 


ACC 


AAA 


CGT 


TCA 


CCC 


AAA 


CCA 


CCA 


AAC 


278 




He 


Lys 


Ser 


Thr 


Thr. 


Lys 


Arg 


Ser 


Pro 


Lys 


Pro 


Pro 


Asn 




AAG 


AAG 


AAG 


ACT 


AAG 


AAA 


GTT 


ATA 


GAA 


TCA 


GAG 


GAA 


ATA 


317 


20 


Lys 


Lys 


Lys 


Thr 


Lys 


Lys 


Val 


He 


Glu 


Ser 


Glu 


Glu 


He 


ACA 


GAA 


GGT 


AGG 


AAG 


ATG 


ACA 


GAT 


ATA 


ATC 


AAA 


GGA 


GCT 


356 




Thr 


Glu 


Gly 


Arg 


Lys 


Met 


Thr 


Asp 


He 


lie 


Lys 


Gly Ala 


25 


TTC 


TTA 


GAT 


GAA 


GTA 


ACT 


TGT 


AGG 


TGACTGCTTA 






390 


Phe 


Lys 


Asp 


Glu 


Val 


Thr 


Cys 


Arg 













30 



TCTAAGCCCA 


TTCTCAGAGA 


ACAGGGTAAT 


CTTAGGAATC 


430 


ATGAGCCTCA 


TTACACTCGA 


AGGTTTTAGA 


CTTTGCTTTT 


470 


AAGTAAAGTT 


TAAGACAAAG 


TATAAACTCT 


CAGCTCTTTC 


500 


TGTATTTACC 


AAACCCAGCA 


TGAGTCTGGG 


TTAAAACAAA 


540 


TCAGAAGGGA 


CAAATCTTAC 


TATAAAAAAC 


AAAAACCACC 


580 


CCATGATTTT 


CTTTGTAGAA 


TAATTTGATT 


CTGTGTTTTG 


620 


GAGGATATGG 


GAAAGTTGAG 


AGATACTAGT 


AATACTGCTA 


660 


GTATCTGTGA 


TAAGCCCAGG 


TGCCTTGCTT 


TTAACTGACA 


700 


GATTAAAAGG 


CAGTTGGTCA 


TATTACTAAT 


AAAAGCAAAA 


740 



40 
80 
120 
160 
200 
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TCCAGATACT TGTAGACTAG TAAATAGCAC TTCTTGCTGT 780 

GTTTAGACTG GTGGTTCTTT TTTGTTTTAA ATCACAGTTG 820 

GTGTGATCC 829 
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TABLE III 

Putative Partial cDNA <i meg-cSF 
(1) 

AAT TCT CTC TCA CCA AGTGGC TTT GTC CCC CTC GTT AGA TTG 
5 Asn Ser Leu Ser Pro SerGly Phe Val Pro Leu Val Arg Leu 

(2<J 

CTC CCT TTC TAT AAA GTGGTT TGG CCA TAT TTA CGC CAG TAT 
Leu Pro Phe Tyr Lys Valval Trp Pro Tyr Leu Arg Gin Tyr 

(30) (40) 
10 TGT ATA ATT TTA GAT TTATCA AGC TGT GCA GGG AGA TGT GGG 

Cys lie He Leu Asp LeuSer Ser Cys Ala Gly Arg Cys Gly 

(50) 

GAA GGG TAT TCT AGA GATGCC ACC TGC AAC TGT GAT TAT AAC 
Glu Gly Tyr Ser Arg Asp&la Thr Cys Asn Cys Asp Tyr Asn 

15 (60) (70) 

TGT CAA CAC TAC ATG GAGTGC TGC CCT GAT TTC AAG AGA GTC 
Cys Gin His Tyr Met GluCys Cys Pro Asp Phe Lys Arg Val 



20 



(80) 

TGC ACT GCG GAG CTT TCCfGT AAA GGC CGC TGC TTT GAG TCC 
Cys Thr Ala Glu Leu SerCys Lys Gly Arg Cys Phe Glu Ser 

(90| 

TTC GAG AGA GGG AGG GAGTPGT GAC TGC GAC GCC CAA TGT AAG 
Phe Glu Arg Gly Arg Glueys Asp Cys Asp Ala Gin Cys Lys 

(100) (110) 
25 AAG TAT GAC AAG TGC TGTCCC GAT TAT GAG AGT TTC TGT GCA 

Lys Tyr Asp Lys Cys CysPro Asp Tyr Glu Ser Phe Cys Ala 

(120) 

GAA GTG CAT AAT CCC ACATCA CCA CCA TCT TCA AAG AAA GCA 
Glu Val His Asn Pro ThrSer Pro Pro Ser Ser Lys Lys Ala 

30 ( 13 °) (140) 

CCT CCA CCT TCA GGA GCAfCT CAA ACC ATC AAA TCA ACA ACC 
Pro Pro Pro Ser Gly AlaSer Gin Thr lle_ Lys Ser Thr Thr 

(150) 

AAA CGT TCA CCC AAA CCA€CA AAC AAG AAG AAG ACT AAG AAA 
35 Lys Arg Ser Pro Lys ProPro Asn Lys Lys Lys Thr Lys Lys 
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(160) 

GTT ATA GAA TCA GAG GAA ATA ACA GAA GGT AGG AAG ATG ACA 
Val He Glu Ser Glu Glu He Thr Glu Gly Arg Lys Met Ttir 



(170) (180) 
GAT ATA ATC AAA GGA GCT TTC TTA GAT GAA GTA ACT TGT AGG 
Asp He He Lys Gly Ala Phe Lys Asp Glu Val Thr Cys Arg 
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Two genomic clones containing the meg-CSF 
sequences identified above have been deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, Maryland 20852, USA in accordance with the 
requirements of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure on August 3, 19 90. An 
approximately 12 kb genomic fragment (referred to as Meg 
Kpn-SnaBI) spanning the 5 1 Kpnl site to the 3* SnaBI 
site (see Fig. 1) in an E. coli plasmid was given the 

accession number ATCC . The entire 18.2 kb NotI 

sequence of Fig. 1 (referred to as 18-5665) inserted 
into bacteriophage lambda DNA was deposited under the 
accession number ATCC . 

Preliminary experiments have shown that the 
KpnI/SnaBI fragment appears to produce an active protein 
in COS cells. 

The nucleotide sequence of this meg-CSF 
sequence, specifically Exon I and Exon II, has been 
compared with the nucleotide sequences recorded in 
protein and DNA databanks. The amino terminus of 
vitronectin, the serum adhesion molecule, was observed 
tro hasrs the highest degree of sequence similarity-r T?he 
amino terminal portion is also called Somatomedin B, a 
peptide found in the circulation. Other significant 
sequence similarities were found at the protein and DNA 
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levels to the B domain of von Willebrand factor, made by 
endothelial cells and megakaryocytes and at the protein 
level in the extracellular domain of murine PC-1, a 
membrane glycoprotein dimer found on IgG secreting 
5 plasma cells. These peptides are functionally 

unrelated, with Somatomedin B and PC-1 having unknown 
functions . 

To obtain the entire genomic and cDNA 
sequences the amino and carboxy termini i of meg-CSF may 

10 be determined by various procedures* One procedure 
involves the preparation of a cDNA library from 
activated human peripheral blood leucocytes or other 
sources of meg-CSF RNA and extraction of the full length 
cDNA by hybridization, using the three known Exons as 

15 probes. A second method is expression cloning in COS 
cells. For example, different sections of the 18.2 kb 
genomic insert are subcloned into COS cells and 
different activities identified. If a protein is 
located, the RNA of that clone is isolated and cDNA 

20 prepared therefrom and expressed to obtain the protein. 

For example, this method involves subcloning either the 
full length human genomic clone (referred to as 18- 
5665)7:the KpnI/SnaBI clone, or~different sections of 
the 18.2 kb genomic insert into an expression vector, 

25 transfecting into COS cells, preparing a cDNA library 
from meg-CSF transfected COS cells and screening by 
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hybridization for meg-CSF cDNA. Alternatively, the 
entire sequence, including the N-terrainal Met, may be 
identified by comparison with the murine homolog of meg- 
CSF. Also, the mRNA from a cell source of meg-CSF can 
be used to make a cDNA library which can be screened 
with the probes to identify the cDNAs encoding the meg- 
CSF polypeptide. Techniques to screen for cDNA sources 
include making primers from Exons I-III and employing 
PCR techniques to isolate and amplify cDJSTA transcripts. 

The full-length human genomic clone or 
fragments thereof may also be employed as probes to 
isolate by cross-hybridization the murine genomic 
equivalent of meg-CSF. The murine genomic clone of meg- 
CSF or fragments thereof can be used to identify a mRNA 
source for meg-CSF which can be used to make a murine 
cDNA library. The murine cDNA can be used to identify 
the corresponding Exons in the human meg-CSF gene which 
can then be spliced together to create a full length 
human cDNA. 

Once the entire cDNA is identified, it or any 
portion of it that encodes an active fragment of meg- 
CSF, can be introduced into any one of a variety of 
expression vectors to make an expression- system for meg- 
CSF or one or more fragments thereof. 
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By such use of recombinant techniques, DNA 
sequences encoding the meg-CSF polypeptide are obtained 
which contain DNA sequences encoding one or more of the 
tryptic fragments or the partial sequence identified 
5 above. The present invention also encompasses these DNA 
sequences, free of association with DNA sequences 
encoding other proteins, and coding on expression for 
meg-CSF polypeptides. These DNA sequences include those 
sequences encoding all or a fragment of the above- 

10 identified peptide sequences or partial clone sequence 

and those sequences . which hybridize under stringent 
hybridization conditions [see, T. Maniatis et al, 
Molecular Cloning (A Laboratory Manual! . Cold Spring 
Harbor Laboratory (1982), pages 387 to 389] to the DNA 

15 sequences . 

Jin example of one such stringent hybridization 
condition is hybridization in 4XSSC at 65°C, followed by 
a washing in 0.1XSSC at 65 °C for an hour. Alternatively 
an exemplary stringent hybridization condition is in 50% 

20 formamide, 4XS SC ^t 42 °C. 

DNA sequences which hybridize to the sequences 
for meg-CSF' under relaxed hybridization conditions and 
which code opi expression for meg-CSF peptides having - 
meg-CSF biological properties also encode novel meg-CSF 

25 polypeptides. Examples of such non-stringent 

hybridization conditions are 4XSSC at 50 °C or 
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hybridization with 3 0-40% formamide at 42 °C. For 
example, a DNA sequence which shares regions of 
significant homology, e.g., sites of glycosylation or 
disulfide linkages, with the sequences of meg-CSF and 
encodes a protein having one or more meg-CSF biological 
properties clearly encodes a meg-CSF polypeptide even if 
such a DNA sequence would not stringently hybridize to 
the meg-CSF sequences. 

Allelic variations (naturally-occurring base 
changes in the species population which may or may not 
result in an amino acid change) of DNA sequences 
encoding the peptide sequences of meg-CSF are also 
included in the present invention, as well as analogs or 
derivatives thereof. Similarly, DNA sequences which 
code for meg-CSF polypeptides but which differ in codon 
sequence due to the degeneracies of the genetic code or 
variations in the DNA sequence of meg-CSF which are 
caused by point mutations or by induced modifications to 
enhance the activity, half-life or production of the 
polypeptides- encoded thereby are also encompassed in the 
invention. 

meg-CSF polypeptides may also be produced by 
Jkaown c on veTTc i o nal^-ch em i ca 1 synthesis. Methods for 
constructing the polypeptides of the present invention 
by synthetic means are known to those of skill in the 
art. The synthetically-constructed meg-CSF polypeptide 
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sequences, by virtue of sharing primary, secondary, or 
tertiary structural and conformational characteristics 
with meg-CSF polypeptides may possess meg-CSF biological 
properties in common therewith. Thus, they may be 
employed as biologically active or immunological 
substitutes for natural, purified meg-CSF polypeptides 
in therapeutic and immunological processes. 

Modifications in the peptides or DNA 
sequences encoding meg-CSF can be made by one skilled 
in the art using known techniques. Modifications of 
interest in the meg-CSF sequences may include the 
replacement, insertion or deletion of a selected amino 
acid residue in the coding sequences. Mutagenic 
techniques for such replacement, insertion or deletion 
are well known to one skilled in the art. [See, e.g., 
United States patent 4 , 518 , 584 . ] 

Specific mutations of the sequences of the 
meg-CSF polypeptide may involve modifications of a 
glycosylation site. The absence of glycosylation or 
only partial glycosylation results from amino acid 
substitution or deletion at any asparagine- linked 
glycosylation recognition site or at any site of the 
molecule that is modified by addition of 0-linked 
carbohydrate. An asparagine- linked glycosylation 
recognition site comprises a tripeptide sequence which 
is specifically recognized by appropriate cellular 
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glycosylation enzymes. These tripeptide sequences are 
either Asp-X-Thr or Asp-X-Ser, where X can be any amino 
acid. A variety of amino acid substitutions or 
deletions at one or both of the first or third amino 
acid positions of a glycosylation recognition site 
(and/or amino acid deletion at the second position) 
results in non-glycosylation at the modified tripeptide 
sequence. Expression of such altered nucleotide 
sequences produces variants which are not glycosylated 
at that site. 

Other analogs and derivatives of the sequence 
of meg-CSF which would be expected to retain meg-CSF 
activity in whole or in part may also be easily made by 
one of skill in the art given the disclosures herein. 
One such modification may be the attachment of 
polyethylene glycol (PEG) onto existing lysine residues 
in the meg-CSF sequence or the insertion of one or more 
lysine residues or other amino acid residues that can 
react with PEG or PEG derivatives into the sequence by 
conventional techniques to enable the attachment of PEG 
moieties. Such modifications are believed to be 
encompassed by this invention. 

n The present invention - also provides s method - 

for producing meg-CSF polypeptides or active fragments 
thereof. One method of the present invention involves 
introducing the cDNA encoding a meg-CSF polypeptide into 
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an expression vector to make an expression system for 
meg-CSF. A selected host cell is transformed with the 
vector and cultured. The method of this present 
invention therefore comprises culturing a suitable cell 
or cell line, which has been transformed with a DNA 
sequence coding on expression for a meg-CSF polypeptide 
under the control of known regulatory sequences. 
Regulatory sequences include promoter fragments, 
terminator fragments and other suitable sequences which 
direct the expression of the protein in an appropriate 
host cell. The expressed factor is then recovered, 
isolated and purified from the culture medium (or from 
the cell, if expressed intracellularly) by appropriate 
means known to one of skill in the art. 

Suitable cells or cell lines may be 
mammalian cells, such as Chinese hamster ovary cells 
(CHO) or 3T3 cells. The selection of suitable mammalian 
host cells and methods for transformation, culture, 
amplification, screening and product production and 
purification are known in the art. See, e.g., Gething 
and Sambrook, Nature, 293:620-625 (1981), or 
alternatively, Kaufman et al, Mol. Cell. Biol. , 
5(7) :1750-1759 (1985) or Howley"et al,lj. S. Patent 
4,419,446. Other suitable mammalian cell lines, are the 
monkey COS-l cell line, and the CV-1 cell line. Further 
exemplary mammalian host cells include particularly 
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primate cell lines and rodent cell lines, including 
transformed cell lines. Normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, 
as well as primary explants, are also suitable. 
Candidate cells may be genotypically deficient in the 
selection gene, or may contain a dominantly acting 
selection gene. Other suitable mammalian cell lines 
include but are not limited to, HeLa, mouse L-929 cells, 
3T3 lines derived from Swiss, Balb-c or NIH mice, BHK or 
HaK hamster cell lines. 

Similarly, useful as host cells suitable for 
the present invention are bacterial cells. For example, 
the various strains of L, coli (e.g., HB101, MC1061 and 
strains used in the following examples) are well-known 
as host cells in the field of biotechnology. Various 
strains of B^. subtilis. Pseudomonas . other bacilli and 

the like may also be employed in this method. 

Many strains of yeast cells known to those 

skilled in the art are also available as host cells for 

expression of the polypeptides of the present invention. 

Additionally, where desired, insect cells may be 

utilized as host cells in the method of the present 
-invention. See— e. g. -Miller et-^l, Genetic Enairieer-i ng r 

8:277-298 (Plenum Press 1986) and references cited 

therein. 
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The present invention also provides 
recombinant molecules or vectors for use in the method 
of expression of novel meg-CSF polypeptides ♦ These 
vectors contain the novel meg-CSF DNA sequences recited 

5 herein, and which alone or in combination with other 

sequences code for meg-CSF polypeptides of the invention 
or active fragments thereof. Alternatively, vectors 
incorporating modified sequences as described above are 
also embodiments of the present invention and useful in 

.0 the production of meg-CSF polypeptides. The vector 

employed in the ^method also contains selected regulatory 
sequences in operative association with the DNA coding 
sequences of the invention and capable of directing the 
replication and expression thereof in selected host 

5 cells. 

One desirable vector is pXM [Y. C. Yang et al, 
Cell , 47:3-10 (1986)]. Mammalian cell expression 
vectors described herein may be synthesized by 
techniques well known to those skilled in this art. The 

0 components of the vectors, e.g. repl icons, selection 
genes, enhancers, promoters, and the like, may be 
obtained from natural sources or synthesized by known 
procedures. Sea, Kaufman et al, J.~Mol. Biol. . 159:511- 
521 (1982); and Kaufman, Proc. Natl, Acad. Sci. r USA r 

5 82:689-693 (1985). Alternatively, the vector DNA may 

include all or part of the bovine papilloma virus genome 
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[Lusky et al, Cell, 36:391-401 (1984)] and be carried in 
cell lines such as C127 mouse cells as a stable episomal 
element. The transformation of these vectors into 
appropriate host cells can result in expression of the 

5 meg-CSF polypeptides. 

Other appropriate expression vectors of which 
numerous types are known in the art for mammalian, 
insect, yeast, fungal and bacterial expression can also 
be used for this purpose. 

0 Thus meg-CSF or active fragments thereof, 

purified to homogeneity from cell sources or produced 
recombinantly or synthetically, may be used in a 
pharmaceutical preparation or formulation to stimulate 
platelet recovery following chemotherapy or bone marrow 

5 transplantation, to treat thrombocytopenia, aplastic 
anemia and other platelet disorders. Therapeutic 
treatment of such platelet disorders or deficiencies 
with these meg-CSF polypeptide compositions may avoid 
undesirable side effects caused by treatment with 

) presently available serum-derived factors or 

transfusions of human platelets. It may also be 
possible to employ one or more peptide fragments of meg- 
CSS. such as the peptides above-identified, in- such 
pharmaceutical formulations. 
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The polypeptides of the present invention may 
also be employed, alone or in combination with other 
cytokines, hematopoietins, interleukins, growth factors 
or antibodies in the treatment of the above- identified 
5 conditions . 

Therefore, as yet another aspect of the 
invention are therapeutic compositions for treating the 
conditions referred to above. Such compositions 
comprise a therapeutically effective amount of the meg- 

10 CSF protein or a therapeutically effective fragment 

thereof in admixture with, a pharmaceutical ly acceptable 
carrier- This composition can be systematically 
administered parenterally. Alternatively, the 
composition may be administered intravenously. If 

15 desirable, the composition may be administered 

subcutaneously . When systematically administered, the 
therapeutic composition for use in this invention is in 
the form of a pyrogen- free, parenterally acceptable 
aqueous solution. The preparation of such 

2 0 pharmaceutical ly acceptable protein solutions, having 
due regard to pH, isotonicity, stability and the like, 
is within the skill of the art. 

~~ The dosage regimen involved in a method for 
treating the above-described conditions will be 

25 determined by the attending physician considering 
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various factors which modify the action of drugs, e.g. 
the condition, body weight, sex and diet of the patient, 
the severity of any infection, time of administration 
and other clinical factors. Generally, the daily 
regimen should be in the range of 1-10O0 micrograms of 
meg-CSF protein or fragment: thereof or 50 to 5000 units 
(i.e., one unit being the minimum concentration of meg- 
CSF protein which yields the maximal number of colonies 
in the murine fibrin clot megakaryocyte colony formation 
assay) of protein per kilogram of body weight. 

The therapeutic method, compositions and 
polypeptides of the present invention may also be 
employed, alone or in combination with other cytokines, 
hematopoietic, interleukins, growth factors or 
antibodies in the treatment of disease states 
characterized by other symptoms as well as platelet 
deficiencies. It is anticipated that this molecule, if 
it does not itself have TPO activity, will prove useful 
in treating some forms of thrombocytopenia in 
combination with general stimulators of hematopoiesis, 
such as IL-3, IL-6 or GM-CSF or with other 
megakaryocytic stimulatory factors or molecules with 
TPO-like activity.- Additional exemplary cytokines or^ 
hematopoietic for such co-administration include TPO, 
G-CSF, CSF-1, GM-CSF, IL-1, IL-11 (described as IL-10 in 
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co-owned copending U. s. patent application SN 
07/441,100 incorporated herein by reference), IL-3, IL- 
4 f M-CSF, IL-7 or erythropoietin. The dosage recited 
above would be adjusted to compensate for such 
additional components in the therapeutic composition. 
Progress of the treated patient can be monitored by 
conventional methods . 

Other uses for these novel polypeptides are in 
the development of antibodies generated by standard 
methods for in vivo or in vitro diagnostic or 
therapeutic use. Such antibodies may include both 
monoclonal and polyclonal antibodies, as well as 
chimeric antibodies or "recombinant" antibodies 
generated by known techniques. Also provided by this 
invention are the cell lines generated by presenting 
meg-CSF or a fragment thereof as an antigen to a 
selected mammal , followed by fusing cells of the animal 
with certain cancer cells to create immortalized cell 
lines by known techniques. The methods employed to 
generate such cell lines and antibodies directed against 
all or portions of a human meg-CSF polypeptide of the 
present invention are also encompassed by this 
invention. — ~~ 
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The antibodies of the present invention may be 
utilized for in vivo and in vitro diagnostic purposes, 
such as by associating the antibodies with detectable 
labels or label systems. Alternatively these antibodies 
may be employed for in vivo and in vitro therapeutic 
purposes, such as by association with certain toxic or 
therapeutic compounds or moieties known to those of 
skill in this art. These antibodies also have utility 
as research reagents. 

The following examples illustratively describe 
the purification and characteristics of homogeneous 
human meg-CSF and other methods and products of the 
present invention. These examples are for illustration 
and do not limit the scope of the present invention. 

Example 1 - Pur ification of mea-CSF from Urine 

The following procedures are presently 
employed to obtain homogeneous meg-CSF protein from 
urine of human bone marrow transplant patients. Urine 
from patients with aplastic anemia or thrombocytopenia 
accompanying other disease states may also be used as 
the source of the factor employing this purification. 

— — STEP trt Urina was collected fiom the bone 
marrow transplant patients between days 5 and 18 
following transplant* One hundred liters of pooled 
urine were treated with protease inhibitors 
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phenylmethyl- sulfonyl fluoride (PMSF) and 
ethylenediaminetetraacetic acid (EDTA) . This pooled 
urine was concentrated on an Amicon YM-10 filter (10,000 
molecular weight cut-off) to remove excess pigments and 
5 reduce the volume. A cocktail of protease inhibitors 
(leupeptin, aprotinin, ethylene glycol-bis-tetraacetic 
acid (EGTA) and N-ethylmaleimide (NEM) ) was added to the 
urine at this and the next three steps to minimize 
proteolysis. The pH of the urine concentrate was 

10 adjusted to 8.0 and diluted to a conductivity of 7mS/cm. 

STEP 2: The retentate from this first step of 
the purification was then subjected to anion exchange 
column chromatography on a QAE Zetaprep [Cuno] at pH 
8.0. The QAE flow-through was adjusted to a pH4.5 with 

15 1M acetic acid. 

STEP 3: The flow-through from the second 
purification step was bound to a cation exchange 
chromatographic column, an SP-Zetaprep column [Cuno] at 
pH 4.5. Bound protein containing meg-CSF was eluted 

20 with 1M NaCl at a pH of 4.5. The eluate was pooled, 

protease inhibitors were added as above and the 
materials stored at -80 °C until further chromatography 
was performed. The eluate was then dialyzed against 
Tris-buffered saline (TBS) , with the addition of the 
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protease inhibitors described in Step 1. This dialyzate 
was heated at 56 'C for 30 minutes. Addition of the 
protease inhibitors, while not essential for recovery of 
protein, enabled greater amount of protein to be 
recovered from this step, undegraded by the proteases in 
the system. Pools from this step were also analyzed for 
the presence of megakaryocyte-specif ic growth factors. 
These pools were found to contain meg-CSF activity. 

STEP 4: The resulting material was added to a 
lectin affinity chromatographic column, a Wheat Germ 
Sepharose column [Pharmacia] and eluted with 0.25 M N- 
acetyl glucosamine (N-acglcNH 2 ) in TBS. Urinary meg-CSF 
was found to bind to this column. The bound protein was 
eluted from this column by 20 mM sodium acetate, pH 4.5 
in the presence of the protease inhibitors of step l, 
which were added for the reasons described in Step 3. 

STEP 5: This dialysate was applied to a 10 ml 
S-Toyopearl FPLC cation exchange column and eluted using 
a linear gradient of 0 to 1M NaCl in 20mM sodium acetate 
at pH 4.5. The protein eluted from this step was tested 
for meg-CSF activity in the fibrin clot assay described 
below. The meg-CSF activity was observed to elute in 
two discrete peaks. The major -activity eluted between - 
0.1M and 0.25M NaCl. A minor, but reproducible activity 
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eluteci between 0.3M and 0.5N NaCl. The two activities 
may be due to protein or carbohydrate modification of a 
single protein; however the data presented further 
herein refers -to the major protein . 
5 STEP 6: The eluate from this fifth 

purification step was then purified on a reverse phase 
HPLC (C4) column [Vydac; lcmX25cm] which was eluted with 
a linear gradient of between 23-33% acetonitrile in 
0.1% trifluoroacetic acid (TFA) . This step removes an 

10 abundant 30Kd protein contaminant. 

STEP 7 : The HPLC step was repeated in a 
different solvent system, after the eluate of Step 6 was 
diluted with two parts acetic acid and pyridine. The 
purified material eluted between 6-15% n-propanol in 

15 pyridine and acetic acid on a C18 reverse phase HPLC 

column (0.46 X 25 cm). The material produced after this 
step, when assayed gave the specific activity of greater 
than 5 X 10 7 dilution units reported in the murine assay. 
This optional step removes the bulk of urinary 

20 ribonuclease, a major contaminant, from the preparation. 

STEP 8: The HPLC step was repeated once more 
on a C4 coltimn (Vydac; 0.46 X 25 cm) using 0.15% HFBA in 
acetonitrile. The material eluted between 27-37%""" 
acetonitrile. The last HPLC step removed substantially 

25 all remaining ribonuclease and proteinaceous 
contaminants present after Step 7. 
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This purified meg-CSF material was then 
analyzed by SDS-PAGE, bioassayed and labelled with 125 I. 
Homogenous protein is obtained from this procedure, 
omitting step 7, having a specific activity ranging from 
about 5X10 r to about 2-5X10 8 dilution units per mg 
protein in the murine megakaryocyte colony assay 
described below • 

Example 2 - Sod ium Dodecvl Sulfate-Polyacrvlamide Gel 
Elect r opho r es i s 

The purified protein was analyzed by SDS-PAGE 
performed according to the method of Laemmli [Laemmli, 
U. K. , Nature, 227:680-685 (1970)] on 12% acrylamide 
slab gels (0*75 mm thickness). After electrophoresis, 
the gels were either subjected to autoradiography to 
visualize 125 I-labelled meg-CSF, or silver stain, or cut 
into 0.5-1 cm slices and eluted in 0.5 ml TBS with 0.3% 
deionized BSA overnight at 4°C and assayed for meg-CSF 
activity. Apparent molecular weight was determined with 
protein standards: BRL prestained molecular weight 
markers, 14C molecular weight standards [NEN], or low 
molecular weight SDS-PAGE standards [Biorad]. 

A small aliquot of protein from Steps 6, 1 and 
8 of Example 1 containing active meg-CSF was iodinated 
and subjected to SDS-PAGE. SDS-PAGE analysis (non- 
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reducing conditions) of reverse phase purified meg-CSF 
from step 8 beginning with several fractions which 
eluted before the meg-CSF activity, continuing right 
through the active fractions and ending with fractions 
5 which eluted after the peak of meg-CSF activity, 

revealed the presence of one heterogenous protein band 
ranging in size between 28 and 38 kd. Elution of the 
protein from a parallel gel lane revealed that the 
bioactivity in the murine megakaryocyte colony formation 

10 assay correlated with the presence of the iodinated meg- 

CSF band in the gel- 

Upon reduction, the majority of the protein 
has a molecular weight of between approximately 20-27kd. 
Based on this information meg-CSF may be a dimer. The 

15 protein does not appear to be digestable with N- 

glycanase under standard conditions. 

Example 3 - Recovery of Protein 

Starting with 50 liters of urine, the final 
pooled active fractions from the HPLC column contained 
20 approximately 25 micrograms of protein, estimated from 
the amino acid composition of purified meg-CSF. The 
specific activity of the 28-38 kd meg-CSF protein was 
estimated to be greater than approximately 5 X 10 7 
dilution units/mg in the murine fibrin clot assay 
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described below. One unit of activity is defined as the 
reciprocal of the maximal dilution which stimulates 
maximal colony formation. One megakaryocyte colony is 
defined as 3 or more cells. 

Example 4 - meq-CSF Protein Composition 

meg-CSF obtained. from the eighth step of the 
purification of Example 1, omitting Step 7, was employed 
to obtain tryptic fragments for sequencing. Twenty-five 
micrograms of purified meg-CSF preparation were desalted 
over a reverse phase column. The main peak was then 
fully reduced and alkylated, due to the large number of 
cysteines present therein. This material was again 
eluted through a reverse phase column, and the protein 
peak subjected to SDS-PAGE. Using I 125 labelled material 
as a marker, protein running at approximately 20-27 kd 
was excised from the gel, fixed with methanol : acetic 
acid: water, and rinsed with water. The gel slice was 
macerated. Following neutralization with 0.1M NH,HC0, 

4 3 

solution (200 fil) , the protein contained within the gel 
matrix was then digested with trypsin (2% w/w) . 

Sequencing provided the four peptide 
-Sequences: . "— - ~~ 

(a) Ser Arg Cys Phe Glu Ser Phe Glu Arg 

(b) Arg Val Cys Thr Ala Glu Leu Ser Cys Lys Gly (Arg) 
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(c) Lys Ala Pro Pro Pro $0 Gly Ala Ser Gin Thr lie Lys 

(d) Lys Tyr Asp Lys Cyslfs Pro Asp Thr Glu Ser Phe Cys 

Ala Glu Val His AsnPro 
X represents an ambiguously identified amino acid, which 
is an S or T. () represets a tentatively identified 
amino acid. 

All four of the© tryptic peptides are found 
in the Exons and putativeeDNA sequence of meg-CSF. 

Example 5 - Genomic Clonisr of Urinary meg-CSF. 

Probes consisting of pools of oligonucleotides 
or unique oligonucleotide are designed from the tryptic 
sequences abbve accordingto the method of R. Lathe, 
Mol. Biol. , 183(1): 1-12 (£85) . The following 
oligonucleotide probes arc synthesized on an automated 
DNA synthesizer, with N presenting any of the four 
nucleotides A, T, C, or <&R representing the 
nucleotides A or G; Y repEsenting the nucleotides C or 
T; and H representing theaucleotides A, T, or C: 

(1) tgyttygarthttyga 

(2 ) tgyttygamszttyga 
( 3 ) gthtgyacnctgaryt 

~ (4) aargcnccncsccn" _ ~ 

(5) gcnagycarahathaa 

( 6 ) gcntcncarasathaa 
( 7 ) aartaygaysrtgytg 



WO 91/02001 



PCT/US90/04421 



69 

(8) GCNGARGTNCAYAAYCC 

( 9 ) AAGTATGACAAGTGCTGCCCTGATGAGTCCTTCTGTGCTGA 
GGTGCACAACCCC; and 

( 10 ) AAGTATGACAAGTGCTGCCCTGATGAGAGCTTCTGTGCTGA 
GGTGCACAACCC 

Because the genetic code is degenerate (more 
than one codon can code for the same amino acid) a 
mixture of oligonucleotides are synthesized that contain 
all possible nucleotide sequences encoding the amino 
acid sequence of the selected tryptic fragment or 
portion thereof, it may be possible in some cases to 
reduce the number of oligonucleotides in the probe 
mixture based on codon usage because some codons are 
rarely used in eukaryotic genes, and because of the 
relative infrequency of the dinucleotide CpG in 
eukaryotic coding sequences [see J. J. Toole et al, 
Nature, 312:342-347 (1984)]. The regions of the amino 
acid sequences used for probe design are chosen by 
avoiding highly degenerate codons where possible. The 
oligonucleotides are synthesized on an automated DNA 
synthesizer and the probes are then radioactively 
labelled with polynucleotide kinase and 32 P-ATP. 

The "aegenerate oligeriTicleot4:de~probeK-^re then 
used to screen a human genomic library prepared from 
placenta [Stratagene Cloning Systems, La Jolla, CA] 
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using established techniques [See K. Jacobs et al, 
Nature , 313 ; 806^810 (1985)]. Recombinants from this 
library are plated and duplicate nitrocellulose and/or 
nylon replicas are made of the plates. Typically, the 
5 oligonucleotides are kinased with 32 P gamma ATP and 

hybridized to the filters at 48 °C in 3M TMAC solution 
for 4 8 -9 £ hours. The filters are then washed in 3M TMAC 
in 50 mM Tria at 50 °C for 1 hour followed by two washes 
at room temperature for 30 minutes each in 2 x SSC [See 

10 K. Jacobs et al, Nucleic Acids Res. , 16:4637-4650 

(1988)]. Duplicate positives are plaque purified. 

An 18.2 kb genomic fragment (Fig* I) was shown 
to hybridize to all four tryptic sequences. The first 
partial genomic subclone contained within the 18 . 2 kb 

15 insert to be isolated and sequenced was shown to contain 

one tryptic sequence (a) in a single Exon (Exon II) and 
two partial tryptic sequences (b and d) which are 
contained in Exon II and overlap with adjacent coding 
sequence* Exon III was identified with a probe made 

20 from tryptic (c) . Exon I was identified with a probe 
made to the 5 f end of tryptic (b) . 

The predicted cDNA and predicted amino acid 
sequences of these combined partial genomic clones~are 
reported in Table III above, which encodes a portion of 

25 the meg-CSF protein. While the peptide encoded by this 

partial sequence may produce an active meg-CSF fragment, 
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the sequence lacks the amino terminal Met as well as any 
defined 3 • border. To obtain the remainder of the meg- 
CSF genomic sequence, the full length genomic DNA 
sequence may be expressed in COS cells and a cDNA 
library prepared from COS cell RNA, and the cDNA 
sequence cloned from that source. Alternatively, the 
remainder of the sequence may be deduced by sequence 
comparison with a cross-hybridizing murine genomic meg- 
CSF sequence. The sequence may also be obtained from 
peripheral blood lymphocytes or placenta, two potential 
sources of the mRNA- 

Example 6 - — Expressi on of Recombinant Human meg-rfiy 

To produce meg-CSF or an active fragment 
thereof, the cDNA encoding it is transferred into an 
appropriate expression vector, of which numerous types 
are known in the art for human, insect, yeast, fungal 
and bacterial expression, by standard molecular biology 
techniques. One such vector for mammalian cells is pXM 
[Y. C. Yang et al, Cell, iZ:3-10 (1986)]. This vector 
contains the SV40 origin of replication and enhancer, 
the adenovirus major late promoter, a cDNA copy of the 
•adenoviruB~tripa^ite leader sequence, a small hybrid 
intervening sequence, an SV40 polyadenylation signal and 
the adenovirus VA I gene, in appropriate relationships 
to direct the high level expression of the desired cDNA 
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in mammalian cells [See, e.g., Kaufman, Proc. Natl. 
Acad. Sci. USA , 82:689-693 (1985)]. The pXM vector is 
linearized with the endonuclease enzyme Xhol and 
subsequently ligated in equimolar amount separately to 
5 the cDNA encoding meg-CSF modified by addition of 
synthetic oligonucleotides [Collaborative Research, 
Lexington, MA] that generate Xho I complementary ends to 
generate constructs for expression of meg-CSF. 

Another vector which may be employed to 

10 express meg-CSF in CHO cells is pEMC2Bl. This vector 

may be derived from.pMT2pc which has been deposited with 
the American Type Culture Collection (ATCC), Rockville, 
MD (USA) und^r Accession Number ATCC 40348 . The DNA is 
linearized by digestion of the plasmid with PstI . The 

15 DNA is then blunted using T A DNA polymerase. An 

oligonucleotide 5 1 TGCAGGCGAGCCTGAA TTCCTCGA 3' is then 
ligated into the DNA, recreating the PstI site at the 5 1 
end and adding an EcoRI site and Xhol site before the 
ATG of the DHFR cDNA. This plasmid is called pMT21, 

20 pMT21 is cut with EcoRI and Xhol which cleaves the 

plasmid at two adjacent cloning sites. An EMCV fragment 
of 508 base pairs was cut from pMTgECAT, [S . K . Jong et 
air J, Virol. . 63:1651-1660 (1989)] with the restriction 
enzymes EcoRI and Taqal. A pair of oligonucleotides 68 

25 nucleotides in length were synthesized to duplicate the 
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EMCV sequence up to the AS. The ATG was changed to an 
ATT, and a C is added, crating a Xhol site at the 3' 
end. A Taqal site is sitnted at the 5' end. The 
sequences of the oligonuclotides were: 

5 • CGAGGTTAAAAAACGTCTAGGOCCCCGAACCACGGGGACGTGGTrTTCCTTT 
GAAAAACACGATTGC 3' and itscomplementary strand. 

Ligation of the3|MT21 EcoRI-to-XhoI fragment 
to the EMCV EcoRI -to-Taqalfragment and to the 
Taqal/Xhol oligonucleotides produced the vector pEMC2Bl. 
This vector contains the 240 origin of replication and 
enhancer, the adenovirus nfjor late promoter, a cDNA 
copy of the majority of tk adenovirus tripartite leader 
sequence, a small hybrid itervening sequence, an SV40 
polyadenylation signal andthe adenovirus VA I gene, 
DHFR and ^-lactamase markos and an EMC sequence, in 
appropriate relationships ib direct the high level 
expression of the desiredaBNA in mammalian cells. The 
EMC2B1 vector is lineariz* with the endonuclease enzyme 
EcoRI and subsequently lifted in equimolar amount 
separately to the cDNA enading meg-CSF that was 
previously modified by addtion of synthetic 
.^oligonucleotides that geneaie EcoRI complementary _ends— 
to generate constructs forexpression. 
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The desired vector containing meg-CSF is then 
introduced into appropriate host cells by conventional 
genetic engineering techniques. The transformed cells 
are cultured' and the expressed meg-CSF is recovered and 
5 purified from the culture medium using standard 
techniques. 

A. Mammalian Cell Expression 

To obtain expression of the meg-CSF 
polypeptide in mammalian host cells, the pXM vector 
10 containing the meg-CSF DNA sequence is transfected onto 

COS cells. The conditioned medium from the transfected 
COS cells contains meg-CSF biological activity as 
measured in the murine assays. Similarly the pEMC2Bl 
construct containing the cDNA for meg-CSF is transfected 
15 into CHO cells. 

The mammalian cell expression vectors 
described herein may be synthesized by techniques well 
known to those skilled in this art. One skilled in the 
art can also construct other mammalian expression 

20 vectors comparable to the pXM vector by, e.g., inserting 

the DNA sequence of the meg-CSF from the plasmid with 
appropriate enzymes and employing well-known recombinant 
genetic engineering techniques and other known vectors, 
such as pJL3 and pJL4 [Gough et al., EMBQ J . . 4:645-653 

25 (1985)] and pMT2 (starting with pMT2-VWF, ATCC #67122; 

see PCT application PCT/US87/00033) . 
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Mammalian host cells other than COS cells 
may also be employed in meg-CSF expression. For 
example, preferably for stable integration of the vector 
DNA, and for subsequent amplification of the integrated 
vector DNA, both by conventional methods, CHO cells may 
be employed as a mammalian host cell of choice. 

Once the vectors and host cells are 
selected and transformed, stable transformants are then 
screened for expression of the product by standard 
immunological, biological or enzymatic assays, such as 
those described below in Example 8. The presence of the 
DNA and inRNA encoding the meg-CSF polypeptides may be 
detected by standard procedures such as Southern and 
Northern blotting. Transient expression of the DNA 
encoding the polypeptides during the several days after 
introduction of the expression vector DNA into suitable 
host cells is measured without selection by activity or 
immunologic assay, e.g., the murine fibrin clot assay, 
of the proteins in the culture medium. 

B. Bacterial Expression Systems 

Similarly, one skilled in the art could 
manipulate_the sequences jsftcoding the meg-GSF 
polypeptide by eliminating any human regulatory 
sequences flanking the coding sequences and inserting 
bacterial regulatory sequences to create bacterial 
vectors for intracellular or extracellular expression of 
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the meg-CSF polypeptide of the invention by bacterial 
cells. The DNA encoding the polypeptides may be further 
modified to contain different codons to optimize 
bacterial expression as is known in the art. Preferably 
5 the sequences encoding the mature meg-CSF are 

operatively linked in-frame to nucleotide sequences 
encoding a secretory leader polypeptide permitting 
bacterial expression, secretion and processing of the 
mature meg-CSF polypeptides, also by methods known in 

10 the art. The expression of meg-CSF in JL_ coli using 
such secretion systems is expected to result in the 
secretion of the active, polypeptide. This approach has 
yielded active chimeric antibody fragments [See, e.g., 
Bitter et al, Science, 210:1041-1043 (1983)]. 

15 Alternatively, the meg-CSF may be expressed as a 

cytoplasmic protein in E. coli . In this case, the 
molecule would most likely have to be refolded after 
complete denaturation with guanidine hydrochloride, a 
process also known in the art. For procedures for 

2 0 isolation and refolding of intracellular^ expressed 
proteins, see, for example, U. S. Patent 4,512,922. 

The compounds expressed through, either 
route in bacterial host cells may then be recovered, 
purified, and/or characterized with respect to 

25 physicochemical, biochemical and/or clinical parameters, 
all by known methods. 
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c - Insect or Yeast Cell Expression 

Similar manipulations can be performed 
for the construction of an insect vector for expression 
of meg-CSF polypeptides in insect cells [See, e.g., 
procedures described in published European patent 
application 155,476], 

Similarly yeast vectors are constructed 
employing yeast regulatory sequences to express cDNA 
encoding the precursor, in yeast cells to yield secreted 
extracellular active meg-CSF, Alternatively the 
polypeptide may be expressed intracellular^ in yeast, 
the polypeptide isolated and refolded to yield active 
meg-CSF, [See, e . g procedures described in published 
PCT application WO 86/00639 and European patent 
application EP 123,289.] 

Example 7 - Construc tion of CHO Cell Lines Expressing 

High Levels of meg-CSF 

One method for producing high levels of the 
meg-CSF protein of the invention from mammalian cells 
involves the construction of cells containing multiple 
copies of the cDNA encoding the meg-CSF. 

T3e cDNA is co-trans fected v*i€h an amplifiab±e~ 
marker, e.g., the DHFR gene for which cells containing 
increasing concentrations of methotrexate (MTX) 
according to the procedures of Kaufman and Sharp, 
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Mol. Biol. . (1982) supra . This approach can be employed 
with a number of different cell types. Alternatively, 
the meg-CSF cDNA and drug resistance selection gene 
(e.g., DHFR) may be introduced into the same vector. A 
preferred vector for this approach is pEMC2Bl. 

For example, the pXM vector containing the 
meg-CSF gene in operative association with other plasmid 
sequences enabling expression thereof is introduced into 
DHFR-deficient CHO cells, DUKX-BII, along with a DHFR 
expression plasmid such as pAdD26SVpA3 [Kaufman, Proc. 
Natl. Acad, Sci . USA , 82:689-693 (1985)] by calcium 
phosphate coprecipi tat ion and trans feet ion. 

Alternatively, the pEMC2Bl vector containing 
the meg-CSF gene in operative association with other 
plasmid sequences enabling expression thereof is 
introduced into DHFR-deficient CHO cells, DUKX-BII, by 
protoplast fusion and transfection. The meg-CSF gene 
and DHFR marker gene are both efficiently expressed when 
meg-CSF is introduced into pEMC2Bl. The meg-CSF gene 
may be introduced into pMT2 as previously mentioned and 
the resultant vector used in place of pXM/meg-CSF and 
pAdA26SV(A) 3 . 

DHFR expressing transf ormants "are selected for 
growth in alpha media with dialyzed fetal calf serum. 
Transformants are checked for expression of meg-CSF by 
bioassay, immunoassay or RNA blotting and positive pools 
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are subsequently selected for amplification by growth in 
increasing concentrations of MTX (sequential steps in 
0.02, 0.2, 1.0 and 5uM MTX) as described in Kaufman et 

Mol. Cell Biol, r 5:1750 (1983). The amplified 
lines are cloned, and meg-CSF protein expression is 
monitored by the fibrin clot assay. meg-CSF expression 
is expected to increase with increasing levels of MTX 
resistance. 

In any of the expression systems described 
above, the resulting cell lines can be further 
amplified by appropriate drug selection, resulting cell 
lines recloned and the level of expression assessed 
using the murine fibrin clot assay described above. 

The meg-CSF expressing CHO cell lines can be 
adapted to growth in serum- free medium. Homogeneous 
meg-CSF can be isolated from conditioned medium from the 
cell line using methods familiar in the art, including 
techniques such as lectin-af f inity chromatography, 
reverse phase HPLC, FPLC and the like. 

Example 8 - Bio logical Activities of Human mea-CSF 

The following assays were performed using the 
purifiednneg-CSF described in Example 1. -"^he 
recombinant version of the molecule is expected to 
exhibit meg-CSF biological properties in these same 
assays or othe.r assays. 
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A. Murine Fibrin Clot Assay 

The meg-CSF obtained from Step 7 of the 
purification techniques of Example 1 was tested for 
activity in ttxp megakaryocyte colony formation assay 
5 performed substantially as described in S. Kuriya et al, 
Exp, Hematol, . 15:896-901 (1987). A fibrin clot was 
formed containing 2.5 x 10 5 mouse bone marrow cells in a 
96-well plate. The diluted sample was layered around 
the clot and incubated for 6 days. Thereafter, cells 

10 were fixed and megakaryocytes were stained for 

acetylcholinesterase, a specific marker for murine 
megakaryocytes. A colony was defined as three or more 
megakaryocytes per unit area. Two types of 
megakaryocyte colonies were routinely observed: pure 

15 megakaryocyte colonies containing no additional cell 
types, and mixed megakaryocyte colonies containing 
additional non-megakaryocyte cell types. 

The following control samples were 
included in every assay. A positive control was WEHI 

20 conditioned medium (murine IL-3) , which produced between 
7-25 (average 12) megakaryocyte colonies per clot, 
approximately 50% pure and 50% mixed megakaryocyte 
colonies. Another positive control was serum taken from 
lethally irradiated dogs at the nadir of the platelet 

25 count [see Mazur et al, Exp. Hematol . . 13:1164-1172 

(1985)], which produced between 6-22 (average 15) 
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megakaryocyte colonies per clot, of which approximately 
70% were pure and 3 0% were mixed megakaryocyte colonies. 
The negative control was Iscoves Medium, which produced 
2-4 megakaryocyte colonies per clot. 

In the assay, the meg-CSF has a specific 
activity of greater than approximately 5X10 7 dilution 
units/mg of protein. A unit of activity is defined as 
described in Example 3. 

The major meg-CSF obtained from bone 
marrow transplant urine eluted from the S-Toyopearl 
cation exchange column chromatography step in the 
purification of Examples has been analyzed in this 
assay alone, together, and in combination with other 
cytokines. In the fibrin clot assay, it produced 
between 6-16 (average 13) megakaryocyte colonies, with 
50-70% pure megakaryocyte colonies. 

In each assay the samples were tested in 
duplicate and in three dilutions. 

B - Human Plasma Clot mea-CSF Assay 

The meg-CSF of this invention was also 
tested on an assay for human activity, the plasma clot 
meg-CSF assay described in E. Mazur et al, Blood , 
- 5TT277-2E5 (1981.) with modifications. Non-adherent" "~ 
peripheral blood cells were isolated from Leukopacs and 
frozen in aliquots. The test sample was mixed with 
platelet-poor human AB plasma and 1.25 x 10 5 cells in 24- 
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well plates and allowed to clot by the addition of 
calcium. After a 12 day incubation, megakaryocytes were 
identified using a monoclonal antibody directed against 
platelet glycoproteins Ilb/IIIa and a horseradish 
5 peroxidase/ anti-peroxidase chromogenic detection system. 
Recombinant human IL-3 [Genetics Institute, Inc.] was 
used as a positive control, producing 12-3 0 
megakaryocyte colonies per clot with approximately 60% 
pure and 40% mixed megakaryocyte colonies. As in the 

10 murine assay, the aplastic dog serum was also used as a 

positive control, which produced between 5-10 
megakaryocyte colonies per clot, of which approximately 
50% were pure megakaryocyte colonies contained less than 
10 cells, and 50% were mixed megakaryocyte colonies 

15 containing more than 40 megakaryocytes. The negative 

control was Alpha Medium, which produced 0-1 
megakaryocyte colonies per clot. 

The meg-CSF product from Step 8 of the 
above-described purification scheme may be active in 

20 this assay. 

C. Murine meg-CSF Assay 

An assay was performed on the meg-CSF 
from Step 7 of the purification according to P. J. ~~ 
Quensenberry et al, Blood . 65(1) :214-217 (1985). In the 

25 assay, the meg-CSF stimulates the growth of 
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acetylcholinesterase positive megakaryocyte colonies 
containing on average between 4-15 cells per colony. 
The sizes of the megakaryocytes are variable ranging 
from small immature cells to morphologically large 
mature cells, 

D. Other assays 

Several additional megakaryocyte assays 
using murine bone marrow cells were employed including 
the liquid acetylcholinesterase induction assay of 
Ishibashi et al f Blood , 69:1737-1741 (1987) and the 
liquid serotonin uptake assay of Vanucchi et al, exn. 
Hematol , r 16:916-921 (1988). 

Fractions were also routinely assayed in 
several factor dependent cell lines to screen for the 
presence of growth factors which alone or in combination 
might stimulate colony formation. The cell lines used 
were the human erythroleukemic cell line TF-1, the human 
megakaryoblastic cell line MO-7, the murine 11-6- 
dependent cell line T1165, and the murine IL-3 -dependent 
cell line DA-la. 

The foregoing descriptions detail presently 
preferred embodiments of the invention. Numerous 
r. modifications and variations in practice- oT this- 

invention are expected to occur to those skilled in the 
art. such modifications and variations are encompassed 
within the following claims. 
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WHAT IS CLAIMED IS: 

1. A megakaryocyte colony stimulating factor protein 
substantially free from association with other 
proteinaceous materials. 

2. The protein according to claim 1 comprising all or a 
portion of the same or substantially the same sequence 
of amino acids from amino acid #1 to amino acid #182 of 
Table III, or a biologically active fragment thereof. 

3. The protein according to claim 1 encoded by a DNA 
sequence selected from the group consisting of 

(a) the same or substantially the same DNA 
sequence as in Table I, 

(b) the same or substantially the same DNA 
sequence as in Table II, 

(c) the same or substantially the same DNA 
sequence as in Table III, 

(d) a fragment of the sequences (a) through 

(c) ; 

(e) a DNA sequence capable of hybridizing to 
any ur~(a) -trough (d) . — 
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4. The protein according to claim 1 encoded by a DNA 
sequence characterized by a restriction map of Fig. l. 

5. The protein according to claim 1 encoded by the 
genomic KpnI-SnaBI DNA sequence contained in ATCC 



6- The protein according to claim 1 encoded by ttie 
genomic DNA sequence 18-5665 contained in ATCC 



7. The protein according to claim 1 having the ability 
to stimulate growth and development of colonies 
consisting of intermediate and large sized megakaryocyte 
cells. . 

8. The protein according to claim 1 characterized 
biologically by specific activity in a murine fibrin clot 
megakaryocyte colony formation assay of greater than 
approximately 5X10 7 dilution units/mg protein. 

9. The protein according to claim 1 characterized 
biologically by a specific activity in a murine fibrin 
clot megakaryocyte colony formation assay of 2X10 8 "~ 
dilution units/mg. 



WO 91/02001 



T/US90/04421 



10. The protein according to claim 1 having one or more 
of the following characteristics: 

(1) an apparent molecular weight of 
approximately 28-38 kd as determined by 12% sodium 
dodecyl sulfate polyacrylamide gel electrophoresis under 
non-reducing conditions and by murine fibrin clot 
megakaryocyte colony formation bioassay; 

(2) an apparent molecular weight of 
approximately 2 0-27 kd as determined by 12% sodium 
dodecyl sulfate polyacrylamide gel electrophoresis under 
reducing conditions.; 

(3) the ability to bind SP-Zeta Prep under 
acidic conditions of pH4.5; 

(4) the ability to bind to Wheat Germ- 
Sepharose and Concanavalin-A Sepharose; 

(5) the ability to elute between 23-33% 
acetonitrile on a reverse-phase HPLC (C4) column in a 
solvent of acetonitrile in trif luoroacetic acid; 

(6) the ability to elute between 6-15% n- 
propanol on a reverse-phase HPLC (C18) column in a 
solvent of n-propanol in pyridine and acetic acid; 

(7) the ability to elute between 27-37% 

n acetonitrile on .a reverse-phase HPLC (C4)-eolumn in a 
solvent of heptaf luorobutyric acid in acetonitrile. 
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11. The protein according to claim 1 produced by 
subjecting urine from human bone marrow transplant 
patients to purification comprising the steps of: 

(a) concentrating said urine; 

(b) subjecting the resulting retentate to 
anion exchange column chromatography; 

(c) subjecting the flow- through from step (b) 
to cation exchange column chromatography; 

(d) eluting the material from step (c) through 
lectin affinity column chromatography; 

(e) subjecting the eluate from step (d) to 
cation exchange fine performance liquid chromatography; 

(f) diluting the eluate from step (e) with two 
parts TFA and subjecting it to reverse phase high 
pressure liquid chromatography in a solvent of 
acetonitrile and trif luoroacetic acid; 

(g) diluting the eluate from step (f) with two 
parts pyridine and acetic acid and subjecting it to a 
second reverse phase high pressure liquid 
chromatography in a solvent of n-propanol, pyridine and 
acetic acid; and 

(h) optionally subjecting the eluate from step 
(g) to a third reverse phase high pressure liquid 
chromatography in- a solvent of acetonitrile in 
heptofluorobutyric acid. 
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12. The protein according to claim 1 produced by 
culturing a cell line transformed with a DNA sequence 
encoding expression of meg-CSF in operative association 
with an expression control sequence therefor. 

13. The protein according to claim 1 wherein said DNA 
sequence is selected from the group consisting of 

(a) the same or substantially the same DNA 
sequence as in Table I, 

(b) the same or substantially the same DNA 
sequence as in Table II, 

(c) the same or substantially the same DNA 
sequence as in Table III, 

(d) a fragment of the sequences (a) through 

(c) ; 

(e) a DNA sequence capable of hybridizing to 
any of (a) through (d) . 

14. A process for preparing homogeneous meg-CSF 
comprising subjecting urine from bone marrow transplant 
patients to the purification steps of claim 9, wherein 
said meg-CSF elutes from the latter column as a single 
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15. A process for producing ineg-CSF comprising culturing 
a cell line transformed with a cDNA sequence encoding 
expression of meg-QSF in operative association with an 
expression control sequence therefor. 

16. A DNA sequence coding for meg-CSF selected from the 
group consisting of 

(a) the same or substantially the same DNA 
sequence as in Table I, 

(b) the same or substantially the same DNA 
sequence as in Table II, 

(c) the same or substantially the same DNA 
sequence as in Table III, 

(d) a fragment of the sequences (a) through 

(c) ; 

(e) a DNA sequence capable of hybridizing to 
any of (a) through (d) . 

17. A cell transformed with a DNA sequence of claim 16 
in operative association with an expression control 
sequence . 

18. The cell according to cTaim 17 comprising a 
mammalian or bacterial cell. 
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19. Homogeneous meg-CSF having a specific activity in 
the murine fibrin clot megakaryocyte colony formation 
assay of greater than approximately 5X10 7 dilution units 
per mg polypeptide. 

20. The protein according to claim 19 wherein said 
activity is 2X10 8 dilution units per mg polypeptide. 

21. A megakaryocytopoietic protein produced by the steps 
of 

(a) culturing a cell transformed with a DNA 
sequence comprising nucleotide #1 to #7505 of Table I, a 
fragment thereof or a sequence substantially homologous 
thereto ; and 

(b) recovering, isolating and purifying from 
said culture medium a protein comprising amino acids #1 
to #182 as shown in Table III, a fragment thereof or a 
sequence substantially homologous thereto, said protein 
having the ability to stimulate the growth and 
development of megakarocyte cells. 

22. A pharmaceutical composition comprising a 
therapeutically effective amount of meg-CffF or a fragment; — 
thereof in a pharmaceutical ly effective vehicle. 
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23. The composition according to claim 22 further 
comprising therapeutically effective amounts of an 
additional cytokine, hematopoietin, growth factor or 
thrombopoietin-like factor. 

24. The composition according to claim 23 where said 
cytokine is selected from the group consisting of G-CSF, 
CSF-1, GM-CSF, IL-1, IL-3, 11,-4, erythropoietin, IL-11, 
IL-6, TPO, M-CSF and IL-7. 

25. A method for treating bleeding disorders or platelet 
deficiencies comprising administering to a patient an 
effective amount of meg-CSF or a fragment thereof. 

26. The method according to claim 25 further comprising 
administering simultaneously or sequentially with said 
meg-CSF an effective amount of at least one 
hematopoietin, cytokine, growth factor, thrombopoietin- 
like factor or antibody. 

27. The method according to claim 26 wherein said 
hematopoietin is G-CSF, CSF-1, GM-CSF, IL-1, IL-3, IL-4 , 
IL-11, erythropoietin, IL-6, IL-7 or TPO. 
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